Dataset statistics
| Number of variables | 42 |
|---|---|
| Number of observations | 199523 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 3229 |
| Duplicate rows (%) | 1.6% |
| Total size in memory | 413.3 MiB |
| Average record size in memory | 2.1 KiB |
Variable types
| CAT | 32 |
|---|---|
| NUM | 10 |
| Dataset has 3229 (1.6%) duplicate rows | Duplicates |
state_of_previous_residence has a high cardinality: 51 distinct values | High cardinality |
state_of_previous_residence is highly correlated with region_of_previous_residence | High correlation |
region_of_previous_residence is highly correlated with state_of_previous_residence | High correlation |
detailed_household_summary_in_household is highly correlated with detailed_household_and_family_stat | High correlation |
detailed_household_and_family_stat is highly correlated with detailed_household_summary_in_household | High correlation |
live_in_this_house_1_year_ago is highly correlated with migration_code-change_in_msa and 4 other fields | High correlation |
migration_code-change_in_msa is highly correlated with live_in_this_house_1_year_ago and 1 other fields | High correlation |
migration_code-change_in_reg is highly correlated with live_in_this_house_1_year_ago and 1 other fields | High correlation |
migration_code-move_within_reg is highly correlated with live_in_this_house_1_year_ago and 1 other fields | High correlation |
migration_prev_res_in_sunbelt is highly correlated with live_in_this_house_1_year_ago and 1 other fields | High correlation |
year is highly correlated with migration_code-change_in_msa and 4 other fields | High correlation |
dividends_from_stocks is highly skewed (γ1 = 27.78650179) | Skewed |
age has 2839 (1.4%) zeros | Zeros |
detailed_industry_recode has 100684 (50.5%) zeros | Zeros |
detailed_occupation_recode has 100684 (50.5%) zeros | Zeros |
wage_per_hour has 188219 (94.3%) zeros | Zeros |
capital_gains has 192144 (96.3%) zeros | Zeros |
capital_losses has 195617 (98.0%) zeros | Zeros |
dividends_from_stocks has 178382 (89.4%) zeros | Zeros |
num_persons_worked_for_employer has 95983 (48.1%) zeros | Zeros |
weeks_worked_in_year has 95983 (48.1%) zeros | Zeros |
Reproduction
| Analysis started | 2020-11-13 10:21:46.997330 |
|---|---|
| Analysis finished | 2020-11-13 10:23:59.947266 |
| Duration | 2 minutes and 12.95 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
| Distinct | 91 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 34.49419866 |
|---|---|
| Minimum | 0 |
| Maximum | 90 |
| Zeros | 2839 |
| Zeros (%) | 1.4% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 15 |
| median | 33 |
| Q3 | 50 |
| 95-th percentile | 75 |
| Maximum | 90 |
| Range | 90 |
| Interquartile range (IQR) | 35 |
Descriptive statistics
| Standard deviation | 22.31089521 |
|---|---|
| Coefficient of variation (CV) | 0.6468013774 |
| Kurtosis | -0.7328243009 |
| Mean | 34.49419866 |
| Median Absolute Deviation (MAD) | 17 |
| Skewness | 0.3732904573 |
| Sum | 6882386 |
| Variance | 497.7760449 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 34 | 3489 | 1.7% | |
| 35 | 3450 | 1.7% | |
| 36 | 3353 | 1.7% | |
| 31 | 3351 | 1.7% | |
| 33 | 3340 | 1.7% | |
| 5 | 3332 | 1.7% | |
| 4 | 3318 | 1.7% | |
| 3 | 3279 | 1.6% | |
| 37 | 3278 | 1.6% | |
| 38 | 3277 | 1.6% | |
| 2 | 3236 | 1.6% | |
| 7 | 3218 | 1.6% | |
| 30 | 3203 | 1.6% | |
| 32 | 3188 | 1.6% | |
| 8 | 3187 | 1.6% | |
| 6 | 3171 | 1.6% | |
| 9 | 3162 | 1.6% | |
| 13 | 3152 | 1.6% | |
| 39 | 3144 | 1.6% | |
| 1 | 3138 | 1.6% | |
| 41 | 3134 | 1.6% | |
| 10 | 3134 | 1.6% | |
| 11 | 3128 | 1.6% | |
| 40 | 3114 | 1.6% | |
| 14 | 3068 | 1.5% | |
| Other values (66) | 118679 | 59.5% |
| Value | Count | Frequency (%) | |
| 0 | 2839 | 1.4% | |
| 1 | 3138 | 1.6% | |
| 2 | 3236 | 1.6% | |
| 3 | 3279 | 1.6% | |
| 4 | 3318 | 1.7% | |
| 5 | 3332 | 1.7% | |
| 6 | 3171 | 1.6% | |
| 7 | 3218 | 1.6% | |
| 8 | 3187 | 1.6% | |
| 9 | 3162 | 1.6% |
| Value | Count | Frequency (%) | |
| 90 | 725 | 0.4% | |
| 89 | 195 | 0.1% | |
| 88 | 241 | 0.1% | |
| 87 | 301 | 0.2% | |
| 86 | 348 | 0.2% | |
| 85 | 423 | 0.2% | |
| 84 | 519 | 0.3% | |
| 83 | 561 | 0.3% | |
| 82 | 615 | 0.3% | |
| 81 | 720 | 0.4% |
class_of_worker
Categorical
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Not in universe | |
|---|---|
| Private | |
| Self-employed-not incorporated | 8445 |
| Local government | 7784 |
| State government | 4227 |
| Other values (4) | 6794 |
| Value | Count | Frequency (%) | |
| Not in universe | 100245 | 50.2% | |
| Private | 72028 | 36.1% | |
| Self-employed-not incorporated | 8445 | 4.2% | |
| Local government | 7784 | 3.9% | |
| State government | 4227 | 2.1% | |
| Self-employed-incorporated | 3265 | 1.6% | |
| Federal government | 2925 | 1.5% | |
| Never worked | 439 | 0.2% | |
| Without pay | 165 | 0.1% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 30 |
|---|---|
| Median length | 15 |
| Mean length | 13.02115546 |
| Min length | 7 |
Most occurring characters
| Value | Count | Frequency (%) | |
| e | 360624 | 13.9% | |
| i | 284393 | 10.9% | |
| n | 250517 | 9.6% | |
| 224475 | 8.6% | ||
| t | 216148 | 8.3% | |
| r | 214432 | 8.3% | |
| v | 187648 | 7.2% | |
| o | 167144 | 6.4% | |
| N | 100684 | 3.9% | |
| u | 100410 | 3.9% | |
| s | 100245 | 3.9% | |
| a | 98839 | 3.8% | |
| P | 72028 | 2.8% | |
| l | 34129 | 1.3% | |
| d | 26784 | 1.0% | |
| m | 26646 | 1.0% | |
| p | 23585 | 0.9% | |
| - | 23420 | 0.9% | |
| c | 19494 | 0.8% | |
| S | 15937 | 0.6% | |
| g | 14936 | 0.6% | |
| y | 11875 | 0.5% | |
| f | 11710 | 0.5% | |
| L | 7784 | 0.3% | |
| F | 2925 | 0.1% | |
| Other values (4) | 1208 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 2150602 | 82.8% | |
| Space Separator | 224475 | 8.6% | |
| Uppercase Letter | 199523 | 7.7% | |
| Dash Punctuation | 23420 | 0.9% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| N | 100684 | 50.5% | |
| P | 72028 | 36.1% | |
| S | 15937 | 8.0% | |
| L | 7784 | 3.9% | |
| F | 2925 | 1.5% | |
| W | 165 | 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| e | 360624 | 16.8% | |
| i | 284393 | 13.2% | |
| n | 250517 | 11.6% | |
| t | 216148 | 10.1% | |
| r | 214432 | 10.0% | |
| v | 187648 | 8.7% | |
| o | 167144 | 7.8% | |
| u | 100410 | 4.7% | |
| s | 100245 | 4.7% | |
| a | 98839 | 4.6% | |
| l | 34129 | 1.6% | |
| d | 26784 | 1.2% | |
| m | 26646 | 1.2% | |
| p | 23585 | 1.1% | |
| c | 19494 | 0.9% | |
| g | 14936 | 0.7% | |
| y | 11875 | 0.6% | |
| f | 11710 | 0.5% | |
| w | 439 | < 0.1% | |
| k | 439 | < 0.1% | |
| h | 165 | < 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 224475 | 100.0% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 23420 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 2350125 | 90.5% | |
| Common | 247895 | 9.5% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| e | 360624 | 15.3% | |
| i | 284393 | 12.1% | |
| n | 250517 | 10.7% | |
| t | 216148 | 9.2% | |
| r | 214432 | 9.1% | |
| v | 187648 | 8.0% | |
| o | 167144 | 7.1% | |
| N | 100684 | 4.3% | |
| u | 100410 | 4.3% | |
| s | 100245 | 4.3% | |
| a | 98839 | 4.2% | |
| P | 72028 | 3.1% | |
| l | 34129 | 1.5% | |
| d | 26784 | 1.1% | |
| m | 26646 | 1.1% | |
| p | 23585 | 1.0% | |
| c | 19494 | 0.8% | |
| S | 15937 | 0.7% | |
| g | 14936 | 0.6% | |
| y | 11875 | 0.5% | |
| f | 11710 | 0.5% | |
| L | 7784 | 0.3% | |
| F | 2925 | 0.1% | |
| w | 439 | < 0.1% | |
| k | 439 | < 0.1% | |
| Other values (2) | 330 | < 0.1% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 224475 | 90.6% | ||
| - | 23420 | 9.4% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 2598020 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| e | 360624 | 13.9% | |
| i | 284393 | 10.9% | |
| n | 250517 | 9.6% | |
| 224475 | 8.6% | ||
| t | 216148 | 8.3% | |
| r | 214432 | 8.3% | |
| v | 187648 | 7.2% | |
| o | 167144 | 6.4% | |
| N | 100684 | 3.9% | |
| u | 100410 | 3.9% | |
| s | 100245 | 3.9% | |
| a | 98839 | 3.8% | |
| P | 72028 | 2.8% | |
| l | 34129 | 1.3% | |
| d | 26784 | 1.0% | |
| m | 26646 | 1.0% | |
| p | 23585 | 0.9% | |
| - | 23420 | 0.9% | |
| c | 19494 | 0.8% | |
| S | 15937 | 0.6% | |
| g | 14936 | 0.6% | |
| y | 11875 | 0.5% | |
| f | 11710 | 0.5% | |
| L | 7784 | 0.3% | |
| F | 2925 | 0.1% | |
| Other values (4) | 1208 | < 0.1% |
| Distinct | 52 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.35232028 |
|---|---|
| Minimum | 0 |
| Maximum | 51 |
| Zeros | 100684 |
| Zeros (%) | 50.5% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 33 |
| 95-th percentile | 44 |
| Maximum | 51 |
| Range | 51 |
| Interquartile range (IQR) | 33 |
Descriptive statistics
| Standard deviation | 18.0671288 |
|---|---|
| Coefficient of variation (CV) | 1.17683376 |
| Kurtosis | -1.501107921 |
| Mean | 15.35232028 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.5166876791 |
| Sum | 3063141 |
| Variance | 326.421143 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 100684 | 50.5% | |
| 33 | 17070 | 8.6% | |
| 43 | 8283 | 4.2% | |
| 4 | 5984 | 3.0% | |
| 42 | 4683 | 2.3% | |
| 45 | 4482 | 2.2% | |
| 29 | 4209 | 2.1% | |
| 37 | 4022 | 2.0% | |
| 41 | 3964 | 2.0% | |
| 32 | 3596 | 1.8% | |
| 35 | 3380 | 1.7% | |
| 39 | 2937 | 1.5% | |
| 34 | 2765 | 1.4% | |
| 44 | 2549 | 1.3% | |
| 2 | 2196 | 1.1% | |
| 11 | 1764 | 0.9% | |
| 50 | 1704 | 0.9% | |
| 40 | 1651 | 0.8% | |
| 47 | 1644 | 0.8% | |
| 38 | 1629 | 0.8% | |
| 24 | 1503 | 0.8% | |
| 12 | 1350 | 0.7% | |
| 19 | 1346 | 0.7% | |
| 30 | 1181 | 0.6% | |
| 31 | 1178 | 0.6% | |
| Other values (27) | 13769 | 6.9% |
| Value | Count | Frequency (%) | |
| 0 | 100684 | 50.5% | |
| 1 | 827 | 0.4% | |
| 2 | 2196 | 1.1% | |
| 3 | 563 | 0.3% | |
| 4 | 5984 | 3.0% | |
| 5 | 553 | 0.3% | |
| 6 | 554 | 0.3% | |
| 7 | 422 | 0.2% | |
| 8 | 550 | 0.3% | |
| 9 | 993 | 0.5% |
| Value | Count | Frequency (%) | |
| 51 | 36 | < 0.1% | |
| 50 | 1704 | 0.9% | |
| 49 | 610 | 0.3% | |
| 48 | 652 | 0.3% | |
| 47 | 1644 | 0.8% | |
| 46 | 187 | 0.1% | |
| 45 | 4482 | 2.2% | |
| 44 | 2549 | 1.3% | |
| 43 | 8283 | 4.2% | |
| 42 | 4683 | 2.3% |
| Distinct | 47 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.30655614 |
|---|---|
| Minimum | 0 |
| Maximum | 46 |
| Zeros | 100684 |
| Zeros (%) | 50.5% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 26 |
| 95-th percentile | 38 |
| Maximum | 46 |
| Range | 46 |
| Interquartile range (IQR) | 26 |
Descriptive statistics
| Standard deviation | 14.45420392 |
|---|---|
| Coefficient of variation (CV) | 1.278391381 |
| Kurtosis | -0.8965333655 |
| Mean | 11.30655614 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.829238138 |
| Sum | 2255918 |
| Variance | 208.9240109 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 100684 | 50.5% | |
| 2 | 8756 | 4.4% | |
| 26 | 7887 | 4.0% | |
| 19 | 5413 | 2.7% | |
| 29 | 5105 | 2.6% | |
| 36 | 4145 | 2.1% | |
| 34 | 4025 | 2.0% | |
| 10 | 3683 | 1.8% | |
| 16 | 3445 | 1.7% | |
| 23 | 3392 | 1.7% | |
| 12 | 3340 | 1.7% | |
| 33 | 3325 | 1.7% | |
| 3 | 3195 | 1.6% | |
| 35 | 3168 | 1.6% | |
| 38 | 3003 | 1.5% | |
| 31 | 2699 | 1.4% | |
| 32 | 2398 | 1.2% | |
| 37 | 2234 | 1.1% | |
| 8 | 2151 | 1.1% | |
| 42 | 1918 | 1.0% | |
| 30 | 1897 | 1.0% | |
| 24 | 1847 | 0.9% | |
| 17 | 1771 | 0.9% | |
| 28 | 1661 | 0.8% | |
| 44 | 1592 | 0.8% | |
| Other values (22) | 16789 | 8.4% |
| Value | Count | Frequency (%) | |
| 0 | 100684 | 50.5% | |
| 1 | 544 | 0.3% | |
| 2 | 8756 | 4.4% | |
| 3 | 3195 | 1.6% | |
| 4 | 1364 | 0.7% | |
| 5 | 855 | 0.4% | |
| 6 | 441 | 0.2% | |
| 7 | 731 | 0.4% | |
| 8 | 2151 | 1.1% | |
| 9 | 738 | 0.4% |
| Value | Count | Frequency (%) | |
| 46 | 36 | < 0.1% | |
| 45 | 172 | 0.1% | |
| 44 | 1592 | 0.8% | |
| 43 | 1382 | 0.7% | |
| 42 | 1918 | 1.0% | |
| 41 | 1592 | 0.8% | |
| 40 | 617 | 0.3% | |
| 39 | 1017 | 0.5% | |
| 38 | 3003 | 1.5% | |
| 37 | 2234 | 1.1% |
education
Categorical
| Distinct | 17 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| High school graduate | |
|---|---|
| Children | |
| Some college but no degree | |
| Bachelors degree(BA AB BS) | |
| 7th and 8th grade | |
| Other values (12) |
| Value | Count | Frequency (%) | |
| High school graduate | 48407 | 24.3% | |
| Children | 47422 | 23.8% | |
| Some college but no degree | 27820 | 13.9% | |
| Bachelors degree(BA AB BS) | 19865 | 10.0% | |
| 7th and 8th grade | 8007 | 4.0% | |
| 10th grade | 7557 | 3.8% | |
| 11th grade | 6876 | 3.4% | |
| Masters degree(MA MS MEng MEd MSW MBA) | 6541 | 3.3% | |
| 9th grade | 6230 | 3.1% | |
| Associates degree-occup /vocational | 5358 | 2.7% | |
| Associates degree-academic program | 4363 | 2.2% | |
| 5th or 6th grade | 3277 | 1.6% | |
| 12th grade no diploma | 2126 | 1.1% | |
| 1st 2nd 3rd or 4th grade | 1799 | 0.9% | |
| Prof school degree (MD DDS DVM LLB JD) | 1793 | 0.9% | |
| Doctorate degree(PhD EdD) | 1263 | 0.6% | |
| Less than 1st grade | 819 | 0.4% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 38 |
|---|---|
| Median length | 20 |
| Mean length | 18.86398561 |
| Min length | 8 |
Most occurring characters
| Value | Count | Frequency (%) | |
| e | 459561 | 12.2% | |
| 413799 | 11.0% | ||
| o | 247530 | 6.6% | |
| r | 244586 | 6.5% | |
| g | 239232 | 6.4% | |
| d | 225421 | 6.0% | |
| h | 215132 | 5.7% | |
| a | 205652 | 5.5% | |
| l | 180611 | 4.8% | |
| t | 150966 | 4.0% | |
| c | 133669 | 3.6% | |
| i | 117397 | 3.1% | |
| s | 116566 | 3.1% | |
| n | 99892 | 2.7% | |
| B | 87794 | 2.3% | |
| u | 81585 | 2.2% | |
| S | 62560 | 1.7% | |
| A | 62533 | 1.7% | |
| M | 49373 | 1.3% | |
| H | 48407 | 1.3% | |
| C | 47422 | 1.3% | |
| m | 38672 | 1.0% | |
| ( | 29462 | 0.8% | |
| ) | 29462 | 0.8% | |
| b | 27820 | 0.7% | |
| Other values (22) | 148695 | 4.0% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 2803290 | 74.5% | |
| Space Separator | 413799 | 11.0% | |
| Uppercase Letter | 402776 | 10.7% | |
| Decimal Number | 69931 | 1.9% | |
| Open Punctuation | 29462 | 0.8% | |
| Close Punctuation | 29462 | 0.8% | |
| Dash Punctuation | 9721 | 0.3% | |
| Other Punctuation | 5358 | 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| B | 87794 | 21.8% | |
| S | 62560 | 15.5% | |
| A | 62533 | 15.5% | |
| M | 49373 | 12.3% | |
| H | 48407 | 12.0% | |
| C | 47422 | 11.8% | |
| E | 14345 | 3.6% | |
| D | 12754 | 3.2% | |
| W | 6541 | 1.6% | |
| L | 4405 | 1.1% | |
| P | 3056 | 0.8% | |
| V | 1793 | 0.4% | |
| J | 1793 | 0.4% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| e | 459561 | 16.4% | |
| o | 247530 | 8.8% | |
| r | 244586 | 8.7% | |
| g | 239232 | 8.5% | |
| d | 225421 | 8.0% | |
| h | 215132 | 7.7% | |
| a | 205652 | 7.3% | |
| l | 180611 | 6.4% | |
| t | 150966 | 5.4% | |
| c | 133669 | 4.8% | |
| i | 117397 | 4.2% | |
| s | 116566 | 4.2% | |
| n | 99892 | 3.6% | |
| u | 81585 | 2.9% | |
| m | 38672 | 1.4% | |
| b | 27820 | 1.0% | |
| p | 11847 | 0.4% | |
| v | 5358 | 0.2% | |
| f | 1793 | 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 413799 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 1 | 26053 | 37.3% | |
| 7 | 8007 | 11.4% | |
| 8 | 8007 | 11.4% | |
| 0 | 7557 | 10.8% | |
| 9 | 6230 | 8.9% | |
| 2 | 3925 | 5.6% | |
| 5 | 3277 | 4.7% | |
| 6 | 3277 | 4.7% | |
| 3 | 1799 | 2.6% | |
| 4 | 1799 | 2.6% |
Most frequent Open Punctuation characters
| Value | Count | Frequency (%) | |
| ( | 29462 | 100.0% |
Most frequent Close Punctuation characters
| Value | Count | Frequency (%) | |
| ) | 29462 | 100.0% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 9721 | 100.0% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| / | 5358 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 3206066 | 85.2% | |
| Common | 557733 | 14.8% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| e | 459561 | 14.3% | |
| o | 247530 | 7.7% | |
| r | 244586 | 7.6% | |
| g | 239232 | 7.5% | |
| d | 225421 | 7.0% | |
| h | 215132 | 6.7% | |
| a | 205652 | 6.4% | |
| l | 180611 | 5.6% | |
| t | 150966 | 4.7% | |
| c | 133669 | 4.2% | |
| i | 117397 | 3.7% | |
| s | 116566 | 3.6% | |
| n | 99892 | 3.1% | |
| B | 87794 | 2.7% | |
| u | 81585 | 2.5% | |
| S | 62560 | 2.0% | |
| A | 62533 | 2.0% | |
| M | 49373 | 1.5% | |
| H | 48407 | 1.5% | |
| C | 47422 | 1.5% | |
| m | 38672 | 1.2% | |
| b | 27820 | 0.9% | |
| E | 14345 | 0.4% | |
| D | 12754 | 0.4% | |
| p | 11847 | 0.4% | |
| Other values (7) | 24739 | 0.8% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 413799 | 74.2% | ||
| ( | 29462 | 5.3% | |
| ) | 29462 | 5.3% | |
| 1 | 26053 | 4.7% | |
| - | 9721 | 1.7% | |
| 7 | 8007 | 1.4% | |
| 8 | 8007 | 1.4% | |
| 0 | 7557 | 1.4% | |
| 9 | 6230 | 1.1% | |
| / | 5358 | 1.0% | |
| 2 | 3925 | 0.7% | |
| 5 | 3277 | 0.6% | |
| 6 | 3277 | 0.6% | |
| 3 | 1799 | 0.3% | |
| 4 | 1799 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 3763799 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| e | 459561 | 12.2% | |
| 413799 | 11.0% | ||
| o | 247530 | 6.6% | |
| r | 244586 | 6.5% | |
| g | 239232 | 6.4% | |
| d | 225421 | 6.0% | |
| h | 215132 | 5.7% | |
| a | 205652 | 5.5% | |
| l | 180611 | 4.8% | |
| t | 150966 | 4.0% | |
| c | 133669 | 3.6% | |
| i | 117397 | 3.1% | |
| s | 116566 | 3.1% | |
| n | 99892 | 2.7% | |
| B | 87794 | 2.3% | |
| u | 81585 | 2.2% | |
| S | 62560 | 1.7% | |
| A | 62533 | 1.7% | |
| M | 49373 | 1.3% | |
| H | 48407 | 1.3% | |
| C | 47422 | 1.3% | |
| m | 38672 | 1.0% | |
| ( | 29462 | 0.8% | |
| ) | 29462 | 0.8% | |
| b | 27820 | 0.7% | |
| Other values (22) | 148695 | 4.0% |
| Distinct | 1240 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 55.42690818 |
|---|---|
| Minimum | 0 |
| Maximum | 9999 |
| Zeros | 188219 |
| Zeros (%) | 94.3% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 495 |
| Maximum | 9999 |
| Range | 9999 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 274.8964539 |
|---|---|
| Coefficient of variation (CV) | 4.959620931 |
| Kurtosis | 155.2188969 |
| Mean | 55.42690818 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 8.935096531 |
| Sum | 11058943 |
| Variance | 75568.06037 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 188219 | 94.3% | |
| 500 | 734 | 0.4% | |
| 600 | 546 | 0.3% | |
| 700 | 534 | 0.3% | |
| 800 | 507 | 0.3% | |
| 1000 | 386 | 0.2% | |
| 425 | 376 | 0.2% | |
| 900 | 336 | 0.2% | |
| 550 | 280 | 0.1% | |
| 1200 | 256 | 0.1% | |
| 1100 | 235 | 0.1% | |
| 650 | 229 | 0.1% | |
| 450 | 222 | 0.1% | |
| 1500 | 221 | 0.1% | |
| 750 | 202 | 0.1% | |
| 1300 | 198 | 0.1% | |
| 850 | 167 | 0.1% | |
| 525 | 147 | 0.1% | |
| 1600 | 136 | 0.1% | |
| 1400 | 132 | 0.1% | |
| 1800 | 127 | 0.1% | |
| 400 | 125 | 0.1% | |
| 1700 | 116 | 0.1% | |
| 2000 | 108 | 0.1% | |
| 475 | 105 | 0.1% | |
| Other values (1215) | 4879 | 2.4% |
| Value | Count | Frequency (%) | |
| 0 | 188219 | 94.3% | |
| 20 | 1 | < 0.1% | |
| 70 | 1 | < 0.1% | |
| 75 | 2 | < 0.1% | |
| 100 | 11 | < 0.1% | |
| 110 | 1 | < 0.1% | |
| 125 | 1 | < 0.1% | |
| 135 | 1 | < 0.1% | |
| 143 | 1 | < 0.1% | |
| 150 | 6 | < 0.1% |
| Value | Count | Frequency (%) | |
| 9999 | 1 | < 0.1% | |
| 9916 | 1 | < 0.1% | |
| 9800 | 2 | < 0.1% | |
| 9400 | 2 | < 0.1% | |
| 9000 | 1 | < 0.1% | |
| 8800 | 1 | < 0.1% | |
| 8600 | 1 | < 0.1% | |
| 8500 | 1 | < 0.1% | |
| 8300 | 1 | < 0.1% | |
| 8000 | 4 | < 0.1% |
enroll_in_edu_inst_last_wk
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Not in universe | |
|---|---|
| High school | 6892 |
| College or university | 5688 |
| Value | Count | Frequency (%) | |
| Not in universe | 186943 | 93.7% | |
| High school | 6892 | 3.5% | |
| College or university | 5688 | 2.9% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 21 |
|---|---|
| Median length | 15 |
| Mean length | 15.03287842 |
| Min length | 11 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 392154 | 13.1% | ||
| i | 392154 | 13.1% | |
| e | 390950 | 13.0% | |
| n | 379574 | 12.7% | |
| o | 212103 | 7.1% | |
| s | 199523 | 6.7% | |
| r | 198319 | 6.6% | |
| t | 192631 | 6.4% | |
| u | 192631 | 6.4% | |
| v | 192631 | 6.4% | |
| N | 186943 | 6.2% | |
| l | 18268 | 0.6% | |
| h | 13784 | 0.5% | |
| g | 12580 | 0.4% | |
| H | 6892 | 0.2% | |
| c | 6892 | 0.2% | |
| C | 5688 | 0.2% | |
| y | 5688 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 2407728 | 80.3% | |
| Space Separator | 392154 | 13.1% | |
| Uppercase Letter | 199523 | 6.7% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| N | 186943 | 93.7% | |
| H | 6892 | 3.5% | |
| C | 5688 | 2.9% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| i | 392154 | 16.3% | |
| e | 390950 | 16.2% | |
| n | 379574 | 15.8% | |
| o | 212103 | 8.8% | |
| s | 199523 | 8.3% | |
| r | 198319 | 8.2% | |
| t | 192631 | 8.0% | |
| u | 192631 | 8.0% | |
| v | 192631 | 8.0% | |
| l | 18268 | 0.8% | |
| h | 13784 | 0.6% | |
| g | 12580 | 0.5% | |
| c | 6892 | 0.3% | |
| y | 5688 | 0.2% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 392154 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 2607251 | 86.9% | |
| Common | 392154 | 13.1% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| i | 392154 | 15.0% | |
| e | 390950 | 15.0% | |
| n | 379574 | 14.6% | |
| o | 212103 | 8.1% | |
| s | 199523 | 7.7% | |
| r | 198319 | 7.6% | |
| t | 192631 | 7.4% | |
| u | 192631 | 7.4% | |
| v | 192631 | 7.4% | |
| N | 186943 | 7.2% | |
| l | 18268 | 0.7% | |
| h | 13784 | 0.5% | |
| g | 12580 | 0.5% | |
| H | 6892 | 0.3% | |
| c | 6892 | 0.3% | |
| C | 5688 | 0.2% | |
| y | 5688 | 0.2% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 392154 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 2999405 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 392154 | 13.1% | ||
| i | 392154 | 13.1% | |
| e | 390950 | 13.0% | |
| n | 379574 | 12.7% | |
| o | 212103 | 7.1% | |
| s | 199523 | 6.7% | |
| r | 198319 | 6.6% | |
| t | 192631 | 6.4% | |
| u | 192631 | 6.4% | |
| v | 192631 | 6.4% | |
| N | 186943 | 6.2% | |
| l | 18268 | 0.6% | |
| h | 13784 | 0.5% | |
| g | 12580 | 0.4% | |
| H | 6892 | 0.2% | |
| c | 6892 | 0.2% | |
| C | 5688 | 0.2% | |
| y | 5688 | 0.2% |
marital_stat
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Never married | |
|---|---|
| Married-civilian spouse present | |
| Divorced | |
| Widowed | |
| Separated | 3460 |
| Other values (2) | 2183 |
| Value | Count | Frequency (%) | |
| Never married | 86485 | 43.3% | |
| Married-civilian spouse present | 84222 | 42.2% | |
| Divorced | 12710 | 6.4% | |
| Widowed | 10463 | 5.2% | |
| Separated | 3460 | 1.7% | |
| Married-spouse absent | 1518 | 0.8% | |
| Married-A F spouse present | 665 | 0.3% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 31 |
|---|---|
| Median length | 13 |
| Mean length | 19.99977947 |
| Min length | 7 |
Most occurring characters
| Value | Count | Frequency (%) | |
| e | 633650 | 15.9% | |
| r | 533322 | 13.4% | |
| i | 448729 | 11.2% | |
| a | 265550 | 6.7% | |
| s | 259215 | 6.5% | |
| 258442 | 6.5% | ||
| d | 209986 | 5.3% | |
| v | 183417 | 4.6% | |
| p | 174752 | 4.4% | |
| n | 170627 | 4.3% | |
| o | 109578 | 2.7% | |
| c | 96932 | 2.4% | |
| t | 89865 | 2.3% | |
| N | 86485 | 2.2% | |
| m | 86485 | 2.2% | |
| M | 86405 | 2.2% | |
| - | 86405 | 2.2% | |
| u | 86405 | 2.2% | |
| l | 84222 | 2.1% | |
| D | 12710 | 0.3% | |
| W | 10463 | 0.3% | |
| w | 10463 | 0.3% | |
| S | 3460 | 0.1% | |
| b | 1518 | < 0.1% | |
| A | 665 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 3444716 | 86.3% | |
| Space Separator | 258442 | 6.5% | |
| Uppercase Letter | 200853 | 5.0% | |
| Dash Punctuation | 86405 | 2.2% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| N | 86485 | 43.1% | |
| M | 86405 | 43.0% | |
| D | 12710 | 6.3% | |
| W | 10463 | 5.2% | |
| S | 3460 | 1.7% | |
| A | 665 | 0.3% | |
| F | 665 | 0.3% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| e | 633650 | 18.4% | |
| r | 533322 | 15.5% | |
| i | 448729 | 13.0% | |
| a | 265550 | 7.7% | |
| s | 259215 | 7.5% | |
| d | 209986 | 6.1% | |
| v | 183417 | 5.3% | |
| p | 174752 | 5.1% | |
| n | 170627 | 5.0% | |
| o | 109578 | 3.2% | |
| c | 96932 | 2.8% | |
| t | 89865 | 2.6% | |
| m | 86485 | 2.5% | |
| u | 86405 | 2.5% | |
| l | 84222 | 2.4% | |
| w | 10463 | 0.3% | |
| b | 1518 | < 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 258442 | 100.0% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 86405 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 3645569 | 91.4% | |
| Common | 344847 | 8.6% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| e | 633650 | 17.4% | |
| r | 533322 | 14.6% | |
| i | 448729 | 12.3% | |
| a | 265550 | 7.3% | |
| s | 259215 | 7.1% | |
| d | 209986 | 5.8% | |
| v | 183417 | 5.0% | |
| p | 174752 | 4.8% | |
| n | 170627 | 4.7% | |
| o | 109578 | 3.0% | |
| c | 96932 | 2.7% | |
| t | 89865 | 2.5% | |
| N | 86485 | 2.4% | |
| m | 86485 | 2.4% | |
| M | 86405 | 2.4% | |
| u | 86405 | 2.4% | |
| l | 84222 | 2.3% | |
| D | 12710 | 0.3% | |
| W | 10463 | 0.3% | |
| w | 10463 | 0.3% | |
| S | 3460 | 0.1% | |
| b | 1518 | < 0.1% | |
| A | 665 | < 0.1% | |
| F | 665 | < 0.1% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 258442 | 74.9% | ||
| - | 86405 | 25.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 3990416 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| e | 633650 | 15.9% | |
| r | 533322 | 13.4% | |
| i | 448729 | 11.2% | |
| a | 265550 | 6.7% | |
| s | 259215 | 6.5% | |
| 258442 | 6.5% | ||
| d | 209986 | 5.3% | |
| v | 183417 | 4.6% | |
| p | 174752 | 4.4% | |
| n | 170627 | 4.3% | |
| o | 109578 | 2.7% | |
| c | 96932 | 2.4% | |
| t | 89865 | 2.3% | |
| N | 86485 | 2.2% | |
| m | 86485 | 2.2% | |
| M | 86405 | 2.2% | |
| - | 86405 | 2.2% | |
| u | 86405 | 2.2% | |
| l | 84222 | 2.1% | |
| D | 12710 | 0.3% | |
| W | 10463 | 0.3% | |
| w | 10463 | 0.3% | |
| S | 3460 | 0.1% | |
| b | 1518 | < 0.1% | |
| A | 665 | < 0.1% |
major_industry_code
Categorical
| Distinct | 24 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Not in universe or children | |
|---|---|
| Retail trade | |
| Manufacturing-durable goods | 9015 |
| Education | 8283 |
| Manufacturing-nondurable goods | 6897 |
| Other values (19) |
| Value | Count | Frequency (%) | |
| Not in universe or children | 100684 | 50.5% | |
| Retail trade | 17070 | 8.6% | |
| Manufacturing-durable goods | 9015 | 4.5% | |
| Education | 8283 | 4.2% | |
| Manufacturing-nondurable goods | 6897 | 3.5% | |
| Finance insurance and real estate | 6145 | 3.1% | |
| Construction | 5984 | 3.0% | |
| Business and repair services | 5651 | 2.8% | |
| Medical except hospital | 4683 | 2.3% | |
| Public administration | 4610 | 2.3% | |
| Other professional services | 4482 | 2.2% | |
| Transportation | 4209 | 2.1% | |
| Hospital services | 3964 | 2.0% | |
| Wholesale trade | 3596 | 1.8% | |
| Agriculture | 3023 | 1.5% | |
| Personal services except private HH | 2937 | 1.5% | |
| Social services | 2549 | 1.3% | |
| Entertainment | 1651 | 0.8% | |
| Communications | 1181 | 0.6% | |
| Utilities and sanitary services | 1178 | 0.6% | |
| Private household services | 945 | 0.5% | |
| Mining | 563 | 0.3% | |
| Forestry and fisheries | 187 | 0.1% | |
| Armed Forces | 36 | < 0.1% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 35 |
|---|---|
| Median length | 27 |
| Mean length | 23.39614982 |
| Min length | 6 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 527882 | 11.3% | ||
| e | 493118 | 10.6% | |
| i | 454739 | 9.7% | |
| n | 445989 | 9.6% | |
| r | 444143 | 9.5% | |
| o | 304536 | 6.5% | |
| t | 242020 | 5.2% | |
| s | 233277 | 5.0% | |
| a | 190749 | 4.1% | |
| c | 188561 | 4.0% | |
| u | 187265 | 4.0% | |
| d | 184892 | 4.0% | |
| l | 180057 | 3.9% | |
| v | 126272 | 2.7% | |
| h | 115522 | 2.5% | |
| N | 100684 | 2.2% | |
| g | 35410 | 0.8% | |
| p | 33546 | 0.7% | |
| M | 21158 | 0.5% | |
| f | 20581 | 0.4% | |
| b | 20522 | 0.4% | |
| R | 17070 | 0.4% | |
| - | 15912 | 0.3% | |
| E | 9934 | 0.2% | |
| H | 9838 | 0.2% | |
| Other values (13) | 64393 | 1.4% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 3918843 | 83.9% | |
| Space Separator | 527882 | 11.3% | |
| Uppercase Letter | 205433 | 4.4% | |
| Dash Punctuation | 15912 | 0.3% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| N | 100684 | 49.0% | |
| M | 21158 | 10.3% | |
| R | 17070 | 8.3% | |
| E | 9934 | 4.8% | |
| H | 9838 | 4.8% | |
| P | 8492 | 4.1% | |
| C | 7165 | 3.5% | |
| F | 6368 | 3.1% | |
| B | 5651 | 2.8% | |
| O | 4482 | 2.2% | |
| T | 4209 | 2.0% | |
| W | 3596 | 1.8% | |
| A | 3059 | 1.5% | |
| S | 2549 | 1.2% | |
| U | 1178 | 0.6% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| e | 493118 | 12.6% | |
| i | 454739 | 11.6% | |
| n | 445989 | 11.4% | |
| r | 444143 | 11.3% | |
| o | 304536 | 7.8% | |
| t | 242020 | 6.2% | |
| s | 233277 | 6.0% | |
| a | 190749 | 4.9% | |
| c | 188561 | 4.8% | |
| u | 187265 | 4.8% | |
| d | 184892 | 4.7% | |
| l | 180057 | 4.6% | |
| v | 126272 | 3.2% | |
| h | 115522 | 2.9% | |
| g | 35410 | 0.9% | |
| p | 33546 | 0.9% | |
| f | 20581 | 0.5% | |
| b | 20522 | 0.5% | |
| m | 8659 | 0.2% | |
| x | 7620 | 0.2% | |
| y | 1365 | < 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 527882 | 100.0% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 15912 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 4124276 | 88.4% | |
| Common | 543794 | 11.6% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| e | 493118 | 12.0% | |
| i | 454739 | 11.0% | |
| n | 445989 | 10.8% | |
| r | 444143 | 10.8% | |
| o | 304536 | 7.4% | |
| t | 242020 | 5.9% | |
| s | 233277 | 5.7% | |
| a | 190749 | 4.6% | |
| c | 188561 | 4.6% | |
| u | 187265 | 4.5% | |
| d | 184892 | 4.5% | |
| l | 180057 | 4.4% | |
| v | 126272 | 3.1% | |
| h | 115522 | 2.8% | |
| N | 100684 | 2.4% | |
| g | 35410 | 0.9% | |
| p | 33546 | 0.8% | |
| M | 21158 | 0.5% | |
| f | 20581 | 0.5% | |
| b | 20522 | 0.5% | |
| R | 17070 | 0.4% | |
| E | 9934 | 0.2% | |
| H | 9838 | 0.2% | |
| m | 8659 | 0.2% | |
| P | 8492 | 0.2% | |
| Other values (11) | 47242 | 1.1% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 527882 | 97.1% | ||
| - | 15912 | 2.9% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 4668070 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 527882 | 11.3% | ||
| e | 493118 | 10.6% | |
| i | 454739 | 9.7% | |
| n | 445989 | 9.6% | |
| r | 444143 | 9.5% | |
| o | 304536 | 6.5% | |
| t | 242020 | 5.2% | |
| s | 233277 | 5.0% | |
| a | 190749 | 4.1% | |
| c | 188561 | 4.0% | |
| u | 187265 | 4.0% | |
| d | 184892 | 4.0% | |
| l | 180057 | 3.9% | |
| v | 126272 | 2.7% | |
| h | 115522 | 2.5% | |
| N | 100684 | 2.2% | |
| g | 35410 | 0.8% | |
| p | 33546 | 0.7% | |
| M | 21158 | 0.5% | |
| f | 20581 | 0.4% | |
| b | 20522 | 0.4% | |
| R | 17070 | 0.4% | |
| - | 15912 | 0.3% | |
| E | 9934 | 0.2% | |
| H | 9838 | 0.2% | |
| Other values (13) | 64393 | 1.4% |
major_occupation_code
Categorical
| Distinct | 15 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Not in universe | |
|---|---|
| Adm support including clerical | |
| Professional specialty | |
| Executive admin and managerial | |
| Other service | |
| Other values (10) |
| Value | Count | Frequency (%) | |
| Not in universe | 100684 | 50.5% | |
| Adm support including clerical | 14837 | 7.4% | |
| Professional specialty | 13940 | 7.0% | |
| Executive admin and managerial | 12495 | 6.3% | |
| Other service | 12099 | 6.1% | |
| Sales | 11783 | 5.9% | |
| Precision production craft & repair | 10518 | 5.3% | |
| Machine operators assmblrs & inspctrs | 6379 | 3.2% | |
| Handlers equip cleaners etc | 4127 | 2.1% | |
| Transportation and material moving | 4020 | 2.0% | |
| Farming forestry and fishing | 3146 | 1.6% | |
| Technicians and related support | 3018 | 1.5% | |
| Protective services | 1661 | 0.8% | |
| Private household services | 780 | 0.4% | |
| Armed Forces | 36 | < 0.1% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 37 |
|---|---|
| Median length | 15 |
| Mean length | 19.74349323 |
| Min length | 5 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 423181 | 10.7% | ||
| i | 414716 | 10.5% | |
| e | 410135 | 10.4% | |
| n | 359087 | 9.1% | |
| r | 299839 | 7.6% | |
| s | 260315 | 6.6% | |
| t | 217320 | 5.5% | |
| o | 209194 | 5.3% | |
| a | 201628 | 5.1% | |
| u | 161296 | 4.1% | |
| c | 145785 | 3.7% | |
| v | 134180 | 3.4% | |
| l | 119120 | 3.0% | |
| N | 100684 | 2.6% | |
| p | 91591 | 2.3% | |
| d | 83327 | 2.1% | |
| m | 57428 | 1.5% | |
| g | 37644 | 1.0% | |
| f | 30750 | 0.8% | |
| P | 26899 | 0.7% | |
| h | 26202 | 0.7% | |
| y | 17086 | 0.4% | |
| & | 16897 | 0.4% | |
| A | 14873 | 0.4% | |
| E | 12495 | 0.3% | |
| Other values (9) | 67609 | 1.7% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 3299644 | 83.8% | |
| Space Separator | 423181 | 10.7% | |
| Uppercase Letter | 199559 | 5.1% | |
| Other Punctuation | 16897 | 0.4% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| N | 100684 | 50.5% | |
| P | 26899 | 13.5% | |
| A | 14873 | 7.5% | |
| E | 12495 | 6.3% | |
| O | 12099 | 6.1% | |
| S | 11783 | 5.9% | |
| T | 7038 | 3.5% | |
| M | 6379 | 3.2% | |
| H | 4127 | 2.1% | |
| F | 3182 | 1.6% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| i | 414716 | 12.6% | |
| e | 410135 | 12.4% | |
| n | 359087 | 10.9% | |
| r | 299839 | 9.1% | |
| s | 260315 | 7.9% | |
| t | 217320 | 6.6% | |
| o | 209194 | 6.3% | |
| a | 201628 | 6.1% | |
| u | 161296 | 4.9% | |
| c | 145785 | 4.4% | |
| v | 134180 | 4.1% | |
| l | 119120 | 3.6% | |
| p | 91591 | 2.8% | |
| d | 83327 | 2.5% | |
| m | 57428 | 1.7% | |
| g | 37644 | 1.1% | |
| f | 30750 | 0.9% | |
| h | 26202 | 0.8% | |
| y | 17086 | 0.5% | |
| x | 12495 | 0.4% | |
| b | 6379 | 0.2% | |
| q | 4127 | 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 423181 | 100.0% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| & | 16897 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 3499203 | 88.8% | |
| Common | 440078 | 11.2% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| i | 414716 | 11.9% | |
| e | 410135 | 11.7% | |
| n | 359087 | 10.3% | |
| r | 299839 | 8.6% | |
| s | 260315 | 7.4% | |
| t | 217320 | 6.2% | |
| o | 209194 | 6.0% | |
| a | 201628 | 5.8% | |
| u | 161296 | 4.6% | |
| c | 145785 | 4.2% | |
| v | 134180 | 3.8% | |
| l | 119120 | 3.4% | |
| N | 100684 | 2.9% | |
| p | 91591 | 2.6% | |
| d | 83327 | 2.4% | |
| m | 57428 | 1.6% | |
| g | 37644 | 1.1% | |
| f | 30750 | 0.9% | |
| P | 26899 | 0.8% | |
| h | 26202 | 0.7% | |
| y | 17086 | 0.5% | |
| A | 14873 | 0.4% | |
| E | 12495 | 0.4% | |
| x | 12495 | 0.4% | |
| O | 12099 | 0.3% | |
| Other values (7) | 43015 | 1.2% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 423181 | 96.2% | ||
| & | 16897 | 3.8% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 3939281 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 423181 | 10.7% | ||
| i | 414716 | 10.5% | |
| e | 410135 | 10.4% | |
| n | 359087 | 9.1% | |
| r | 299839 | 7.6% | |
| s | 260315 | 6.6% | |
| t | 217320 | 5.5% | |
| o | 209194 | 5.3% | |
| a | 201628 | 5.1% | |
| u | 161296 | 4.1% | |
| c | 145785 | 3.7% | |
| v | 134180 | 3.4% | |
| l | 119120 | 3.0% | |
| N | 100684 | 2.6% | |
| p | 91591 | 2.3% | |
| d | 83327 | 2.1% | |
| m | 57428 | 1.5% | |
| g | 37644 | 1.0% | |
| f | 30750 | 0.8% | |
| P | 26899 | 0.7% | |
| h | 26202 | 0.7% | |
| y | 17086 | 0.4% | |
| & | 16897 | 0.4% | |
| A | 14873 | 0.4% | |
| E | 12495 | 0.3% | |
| Other values (9) | 67609 | 1.7% |
race
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| White | |
|---|---|
| Black | |
| Asian or Pacific Islander | 5835 |
| Other | 3657 |
| Amer Indian Aleut or Eskimo | 2251 |
| Value | Count | Frequency (%) | |
| White | 167365 | 83.9% | |
| Black | 20415 | 10.2% | |
| Asian or Pacific Islander | 5835 | 2.9% | |
| Other | 3657 | 1.8% | |
| Amer Indian Aleut or Eskimo | 2251 | 1.1% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 27 |
|---|---|
| Median length | 5 |
| Mean length | 5.833096936 |
| Min length | 5 |
Most occurring characters
| Value | Count | Frequency (%) | |
| i | 189372 | 16.3% | |
| e | 181359 | 15.6% | |
| t | 173273 | 14.9% | |
| h | 171022 | 14.7% | |
| W | 167365 | 14.4% | |
| a | 40171 | 3.5% | |
| c | 32085 | 2.8% | |
| l | 28501 | 2.4% | |
| 26509 | 2.3% | ||
| k | 22666 | 1.9% | |
| B | 20415 | 1.8% | |
| r | 19829 | 1.7% | |
| n | 16172 | 1.4% | |
| s | 13921 | 1.2% | |
| A | 10337 | 0.9% | |
| o | 10337 | 0.9% | |
| I | 8086 | 0.7% | |
| d | 8086 | 0.7% | |
| P | 5835 | 0.5% | |
| f | 5835 | 0.5% | |
| m | 4502 | 0.4% | |
| O | 3657 | 0.3% | |
| u | 2251 | 0.2% | |
| E | 2251 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 919382 | 79.0% | |
| Uppercase Letter | 217946 | 18.7% | |
| Space Separator | 26509 | 2.3% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| W | 167365 | 76.8% | |
| B | 20415 | 9.4% | |
| A | 10337 | 4.7% | |
| I | 8086 | 3.7% | |
| P | 5835 | 2.7% | |
| O | 3657 | 1.7% | |
| E | 2251 | 1.0% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| i | 189372 | 20.6% | |
| e | 181359 | 19.7% | |
| t | 173273 | 18.8% | |
| h | 171022 | 18.6% | |
| a | 40171 | 4.4% | |
| c | 32085 | 3.5% | |
| l | 28501 | 3.1% | |
| k | 22666 | 2.5% | |
| r | 19829 | 2.2% | |
| n | 16172 | 1.8% | |
| s | 13921 | 1.5% | |
| o | 10337 | 1.1% | |
| d | 8086 | 0.9% | |
| f | 5835 | 0.6% | |
| m | 4502 | 0.5% | |
| u | 2251 | 0.2% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 26509 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 1137328 | 97.7% | |
| Common | 26509 | 2.3% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| i | 189372 | 16.7% | |
| e | 181359 | 15.9% | |
| t | 173273 | 15.2% | |
| h | 171022 | 15.0% | |
| W | 167365 | 14.7% | |
| a | 40171 | 3.5% | |
| c | 32085 | 2.8% | |
| l | 28501 | 2.5% | |
| k | 22666 | 2.0% | |
| B | 20415 | 1.8% | |
| r | 19829 | 1.7% | |
| n | 16172 | 1.4% | |
| s | 13921 | 1.2% | |
| A | 10337 | 0.9% | |
| o | 10337 | 0.9% | |
| I | 8086 | 0.7% | |
| d | 8086 | 0.7% | |
| P | 5835 | 0.5% | |
| f | 5835 | 0.5% | |
| m | 4502 | 0.4% | |
| O | 3657 | 0.3% | |
| u | 2251 | 0.2% | |
| E | 2251 | 0.2% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 26509 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 1163837 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| i | 189372 | 16.3% | |
| e | 181359 | 15.6% | |
| t | 173273 | 14.9% | |
| h | 171022 | 14.7% | |
| W | 167365 | 14.4% | |
| a | 40171 | 3.5% | |
| c | 32085 | 2.8% | |
| l | 28501 | 2.4% | |
| 26509 | 2.3% | ||
| k | 22666 | 1.9% | |
| B | 20415 | 1.8% | |
| r | 19829 | 1.7% | |
| n | 16172 | 1.4% | |
| s | 13921 | 1.2% | |
| A | 10337 | 0.9% | |
| o | 10337 | 0.9% | |
| I | 8086 | 0.7% | |
| d | 8086 | 0.7% | |
| P | 5835 | 0.5% | |
| f | 5835 | 0.5% | |
| m | 4502 | 0.4% | |
| O | 3657 | 0.3% | |
| u | 2251 | 0.2% | |
| E | 2251 | 0.2% |
hispanic_origin
Categorical
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| All other | |
|---|---|
| Mexican-American | 8079 |
| Mexican (Mexicano) | 7234 |
| Central or South American | 3895 |
| Puerto Rican | 3313 |
| Other values (5) | 5095 |
| Value | Count | Frequency (%) | |
| All other | 171907 | 86.2% | |
| Mexican-American | 8079 | 4.0% | |
| Mexican (Mexicano) | 7234 | 3.6% | |
| Central or South American | 3895 | 2.0% | |
| Puerto Rican | 3313 | 1.7% | |
| Other Spanish | 2485 | 1.2% | |
| Cuban | 1126 | 0.6% | |
| NA | 874 | 0.4% | |
| Do not know | 306 | 0.2% | |
| Chicano | 304 | 0.2% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 25 |
|---|---|
| Median length | 9 |
| Mean length | 9.968509896 |
| Min length | 2 |
Most occurring characters
| Value | Count | Frequency (%) | |
| l | 347709 | 17.5% | |
| e | 216121 | 10.9% | |
| r | 197469 | 9.9% | |
| 197236 | 9.9% | ||
| o | 191466 | 9.6% | |
| t | 185801 | 9.3% | |
| A | 184755 | 9.3% | |
| h | 181076 | 9.1% | |
| n | 46256 | 2.3% | |
| a | 45644 | 2.3% | |
| i | 40623 | 2.0% | |
| c | 38138 | 1.9% | |
| M | 22547 | 1.1% | |
| x | 22547 | 1.1% | |
| m | 11974 | 0.6% | |
| u | 8334 | 0.4% | |
| - | 8079 | 0.4% | |
| ( | 7234 | 0.4% | |
| ) | 7234 | 0.4% | |
| S | 6380 | 0.3% | |
| C | 5325 | 0.3% | |
| P | 3313 | 0.2% | |
| R | 3313 | 0.2% | |
| O | 2485 | 0.1% | |
| p | 2485 | 0.1% | |
| Other values (6) | 5403 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 1539866 | 77.4% | |
| Uppercase Letter | 229298 | 11.5% | |
| Space Separator | 197236 | 9.9% | |
| Dash Punctuation | 8079 | 0.4% | |
| Open Punctuation | 7234 | 0.4% | |
| Close Punctuation | 7234 | 0.4% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| A | 184755 | 80.6% | |
| M | 22547 | 9.8% | |
| S | 6380 | 2.8% | |
| C | 5325 | 2.3% | |
| P | 3313 | 1.4% | |
| R | 3313 | 1.4% | |
| O | 2485 | 1.1% | |
| N | 874 | 0.4% | |
| D | 306 | 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| l | 347709 | 22.6% | |
| e | 216121 | 14.0% | |
| r | 197469 | 12.8% | |
| o | 191466 | 12.4% | |
| t | 185801 | 12.1% | |
| h | 181076 | 11.8% | |
| n | 46256 | 3.0% | |
| a | 45644 | 3.0% | |
| i | 40623 | 2.6% | |
| c | 38138 | 2.5% | |
| x | 22547 | 1.5% | |
| m | 11974 | 0.8% | |
| u | 8334 | 0.5% | |
| p | 2485 | 0.2% | |
| s | 2485 | 0.2% | |
| b | 1126 | 0.1% | |
| k | 306 | < 0.1% | |
| w | 306 | < 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 197236 | 100.0% |
Most frequent Open Punctuation characters
| Value | Count | Frequency (%) | |
| ( | 7234 | 100.0% |
Most frequent Close Punctuation characters
| Value | Count | Frequency (%) | |
| ) | 7234 | 100.0% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 8079 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 1769164 | 88.9% | |
| Common | 219783 | 11.1% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| l | 347709 | 19.7% | |
| e | 216121 | 12.2% | |
| r | 197469 | 11.2% | |
| o | 191466 | 10.8% | |
| t | 185801 | 10.5% | |
| A | 184755 | 10.4% | |
| h | 181076 | 10.2% | |
| n | 46256 | 2.6% | |
| a | 45644 | 2.6% | |
| i | 40623 | 2.3% | |
| c | 38138 | 2.2% | |
| M | 22547 | 1.3% | |
| x | 22547 | 1.3% | |
| m | 11974 | 0.7% | |
| u | 8334 | 0.5% | |
| S | 6380 | 0.4% | |
| C | 5325 | 0.3% | |
| P | 3313 | 0.2% | |
| R | 3313 | 0.2% | |
| O | 2485 | 0.1% | |
| p | 2485 | 0.1% | |
| s | 2485 | 0.1% | |
| b | 1126 | 0.1% | |
| N | 874 | < 0.1% | |
| D | 306 | < 0.1% | |
| Other values (2) | 612 | < 0.1% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 197236 | 89.7% | ||
| - | 8079 | 3.7% | |
| ( | 7234 | 3.3% | |
| ) | 7234 | 3.3% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 1988947 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| l | 347709 | 17.5% | |
| e | 216121 | 10.9% | |
| r | 197469 | 9.9% | |
| 197236 | 9.9% | ||
| o | 191466 | 9.6% | |
| t | 185801 | 9.3% | |
| A | 184755 | 9.3% | |
| h | 181076 | 9.1% | |
| n | 46256 | 2.3% | |
| a | 45644 | 2.3% | |
| i | 40623 | 2.0% | |
| c | 38138 | 1.9% | |
| M | 22547 | 1.1% | |
| x | 22547 | 1.1% | |
| m | 11974 | 0.6% | |
| u | 8334 | 0.4% | |
| - | 8079 | 0.4% | |
| ( | 7234 | 0.4% | |
| ) | 7234 | 0.4% | |
| S | 6380 | 0.3% | |
| C | 5325 | 0.3% | |
| P | 3313 | 0.2% | |
| R | 3313 | 0.2% | |
| O | 2485 | 0.1% | |
| p | 2485 | 0.1% | |
| Other values (6) | 5403 | 0.3% |
sex
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Female | |
|---|---|
| Male |
| Value | Count | Frequency (%) | |
| Female | 103984 | 52.1% | |
| Male | 95539 | 47.9% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 5.042325947 |
| Min length | 4 |
Most occurring characters
| Value | Count | Frequency (%) | |
| e | 303507 | 30.2% | |
| a | 199523 | 19.8% | |
| l | 199523 | 19.8% | |
| F | 103984 | 10.3% | |
| m | 103984 | 10.3% | |
| M | 95539 | 9.5% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 806537 | 80.2% | |
| Uppercase Letter | 199523 | 19.8% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| F | 103984 | 52.1% | |
| M | 95539 | 47.9% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| e | 303507 | 37.6% | |
| a | 199523 | 24.7% | |
| l | 199523 | 24.7% | |
| m | 103984 | 12.9% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 1006060 | 100.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| e | 303507 | 30.2% | |
| a | 199523 | 19.8% | |
| l | 199523 | 19.8% | |
| F | 103984 | 10.3% | |
| m | 103984 | 10.3% | |
| M | 95539 | 9.5% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 1006060 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| e | 303507 | 30.2% | |
| a | 199523 | 19.8% | |
| l | 199523 | 19.8% | |
| F | 103984 | 10.3% | |
| m | 103984 | 10.3% | |
| M | 95539 | 9.5% |
member_of_a_labor_union
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Not in universe | |
|---|---|
| No | 16034 |
| Yes | 3030 |
| Value | Count | Frequency (%) | |
| Not in universe | 180459 | 90.4% | |
| No | 16034 | 8.0% | |
| Yes | 3030 | 1.5% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 15 |
|---|---|
| Median length | 15 |
| Mean length | 13.77306376 |
| Min length | 2 |
Most occurring characters
| Value | Count | Frequency (%) | |
| e | 363948 | 13.2% | |
| 360918 | 13.1% | ||
| i | 360918 | 13.1% | |
| n | 360918 | 13.1% | |
| N | 196493 | 7.2% | |
| o | 196493 | 7.2% | |
| s | 183489 | 6.7% | |
| t | 180459 | 6.6% | |
| u | 180459 | 6.6% | |
| v | 180459 | 6.6% | |
| r | 180459 | 6.6% | |
| Y | 3030 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 2187602 | 79.6% | |
| Space Separator | 360918 | 13.1% | |
| Uppercase Letter | 199523 | 7.3% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| N | 196493 | 98.5% | |
| Y | 3030 | 1.5% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| e | 363948 | 16.6% | |
| i | 360918 | 16.5% | |
| n | 360918 | 16.5% | |
| o | 196493 | 9.0% | |
| s | 183489 | 8.4% | |
| t | 180459 | 8.2% | |
| u | 180459 | 8.2% | |
| v | 180459 | 8.2% | |
| r | 180459 | 8.2% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 360918 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 2387125 | 86.9% | |
| Common | 360918 | 13.1% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| e | 363948 | 15.2% | |
| i | 360918 | 15.1% | |
| n | 360918 | 15.1% | |
| N | 196493 | 8.2% | |
| o | 196493 | 8.2% | |
| s | 183489 | 7.7% | |
| t | 180459 | 7.6% | |
| u | 180459 | 7.6% | |
| v | 180459 | 7.6% | |
| r | 180459 | 7.6% | |
| Y | 3030 | 0.1% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 360918 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 2748043 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| e | 363948 | 13.2% | |
| 360918 | 13.1% | ||
| i | 360918 | 13.1% | |
| n | 360918 | 13.1% | |
| N | 196493 | 7.2% | |
| o | 196493 | 7.2% | |
| s | 183489 | 6.7% | |
| t | 180459 | 6.6% | |
| u | 180459 | 6.6% | |
| v | 180459 | 6.6% | |
| r | 180459 | 6.6% | |
| Y | 3030 | 0.1% |
reason_for_unemployment
Categorical
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Not in universe | |
|---|---|
| Other job loser | 2038 |
| Re-entrant | 2019 |
| Job loser - on layoff | 976 |
| Job leaver | 598 |
| Value | Count | Frequency (%) | |
| Not in universe | 193453 | 97.0% | |
| Other job loser | 2038 | 1.0% | |
| Re-entrant | 2019 | 1.0% | |
| Job loser - on layoff | 976 | 0.5% | |
| Job leaver | 598 | 0.3% | |
| New entrant | 439 | 0.2% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 21 |
|---|---|
| Median length | 15 |
| Mean length | 14.9549676 |
| Min length | 10 |
Most occurring characters
| Value | Count | Frequency (%) | |
| e | 398070 | 13.3% | |
| 395923 | 13.3% | ||
| n | 392798 | 13.2% | |
| i | 386906 | 13.0% | |
| o | 202031 | 6.8% | |
| r | 201561 | 6.8% | |
| t | 200407 | 6.7% | |
| s | 196467 | 6.6% | |
| v | 194051 | 6.5% | |
| N | 193892 | 6.5% | |
| u | 193453 | 6.5% | |
| l | 4588 | 0.2% | |
| a | 4032 | 0.1% | |
| b | 3612 | 0.1% | |
| - | 2995 | 0.1% | |
| O | 2038 | 0.1% | |
| h | 2038 | 0.1% | |
| j | 2038 | 0.1% | |
| R | 2019 | 0.1% | |
| f | 1952 | 0.1% | |
| J | 1574 | 0.1% | |
| y | 976 | < 0.1% | |
| w | 439 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 2385419 | 79.9% | |
| Space Separator | 395923 | 13.3% | |
| Uppercase Letter | 199523 | 6.7% | |
| Dash Punctuation | 2995 | 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| N | 193892 | 97.2% | |
| O | 2038 | 1.0% | |
| R | 2019 | 1.0% | |
| J | 1574 | 0.8% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| e | 398070 | 16.7% | |
| n | 392798 | 16.5% | |
| i | 386906 | 16.2% | |
| o | 202031 | 8.5% | |
| r | 201561 | 8.4% | |
| t | 200407 | 8.4% | |
| s | 196467 | 8.2% | |
| v | 194051 | 8.1% | |
| u | 193453 | 8.1% | |
| l | 4588 | 0.2% | |
| a | 4032 | 0.2% | |
| b | 3612 | 0.2% | |
| h | 2038 | 0.1% | |
| j | 2038 | 0.1% | |
| f | 1952 | 0.1% | |
| y | 976 | < 0.1% | |
| w | 439 | < 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 395923 | 100.0% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 2995 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 2584942 | 86.6% | |
| Common | 398918 | 13.4% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| e | 398070 | 15.4% | |
| n | 392798 | 15.2% | |
| i | 386906 | 15.0% | |
| o | 202031 | 7.8% | |
| r | 201561 | 7.8% | |
| t | 200407 | 7.8% | |
| s | 196467 | 7.6% | |
| v | 194051 | 7.5% | |
| N | 193892 | 7.5% | |
| u | 193453 | 7.5% | |
| l | 4588 | 0.2% | |
| a | 4032 | 0.2% | |
| b | 3612 | 0.1% | |
| O | 2038 | 0.1% | |
| h | 2038 | 0.1% | |
| j | 2038 | 0.1% | |
| R | 2019 | 0.1% | |
| f | 1952 | 0.1% | |
| J | 1574 | 0.1% | |
| y | 976 | < 0.1% | |
| w | 439 | < 0.1% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 395923 | 99.2% | ||
| - | 2995 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 2983860 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| e | 398070 | 13.3% | |
| 395923 | 13.3% | ||
| n | 392798 | 13.2% | |
| i | 386906 | 13.0% | |
| o | 202031 | 6.8% | |
| r | 201561 | 6.8% | |
| t | 200407 | 6.7% | |
| s | 196467 | 6.6% | |
| v | 194051 | 6.5% | |
| N | 193892 | 6.5% | |
| u | 193453 | 6.5% | |
| l | 4588 | 0.2% | |
| a | 4032 | 0.1% | |
| b | 3612 | 0.1% | |
| - | 2995 | 0.1% | |
| O | 2038 | 0.1% | |
| h | 2038 | 0.1% | |
| j | 2038 | 0.1% | |
| R | 2019 | 0.1% | |
| f | 1952 | 0.1% | |
| J | 1574 | 0.1% | |
| y | 976 | < 0.1% | |
| w | 439 | < 0.1% |
full_or_part_time_employment_stat
Categorical
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Children or Armed Forces | |
|---|---|
| Full-time schedules | |
| Not in labor force | |
| PT for non-econ reasons usually FT | 3322 |
| Unemployed full-time | 2311 |
| Other values (3) | 2577 |
| Value | Count | Frequency (%) | |
| Children or Armed Forces | 123769 | 62.0% | |
| Full-time schedules | 40736 | 20.4% | |
| Not in labor force | 26808 | 13.4% | |
| PT for non-econ reasons usually FT | 3322 | 1.7% | |
| Unemployed full-time | 2311 | 1.2% | |
| PT for econ reasons usually PT | 1209 | 0.6% | |
| Unemployed part- time | 843 | 0.4% | |
| PT for econ reasons usually FT | 525 | 0.3% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 34 |
|---|---|
| Median length | 24 |
| Mean length | 22.33263834 |
| Min length | 18 |
Most occurring characters
| Value | Count | Frequency (%) | |
| r | 559647 | 12.6% | |
| e | 539897 | 12.1% | |
| 521744 | 11.7% | ||
| o | 349606 | 7.8% | |
| d | 291428 | 6.5% | |
| l | 290673 | 6.5% | |
| s | 220409 | 4.9% | |
| c | 196369 | 4.4% | |
| i | 194467 | 4.4% | |
| m | 170813 | 3.8% | |
| n | 170487 | 3.8% | |
| F | 168352 | 3.8% | |
| h | 164505 | 3.7% | |
| C | 123769 | 2.8% | |
| A | 123769 | 2.8% | |
| u | 93895 | 2.1% | |
| t | 71541 | 1.6% | |
| - | 47212 | 1.1% | |
| a | 37763 | 0.8% | |
| f | 34175 | 0.8% | |
| N | 26808 | 0.6% | |
| b | 26808 | 0.6% | |
| T | 10112 | 0.2% | |
| y | 8210 | 0.2% | |
| P | 6265 | 0.1% | |
| Other values (2) | 7151 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 3424690 | 76.9% | |
| Space Separator | 521744 | 11.7% | |
| Uppercase Letter | 462229 | 10.4% | |
| Dash Punctuation | 47212 | 1.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| F | 168352 | 36.4% | |
| C | 123769 | 26.8% | |
| A | 123769 | 26.8% | |
| N | 26808 | 5.8% | |
| T | 10112 | 2.2% | |
| P | 6265 | 1.4% | |
| U | 3154 | 0.7% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| r | 559647 | 16.3% | |
| e | 539897 | 15.8% | |
| o | 349606 | 10.2% | |
| d | 291428 | 8.5% | |
| l | 290673 | 8.5% | |
| s | 220409 | 6.4% | |
| c | 196369 | 5.7% | |
| i | 194467 | 5.7% | |
| m | 170813 | 5.0% | |
| n | 170487 | 5.0% | |
| h | 164505 | 4.8% | |
| u | 93895 | 2.7% | |
| t | 71541 | 2.1% | |
| a | 37763 | 1.1% | |
| f | 34175 | 1.0% | |
| b | 26808 | 0.8% | |
| y | 8210 | 0.2% | |
| p | 3997 | 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 521744 | 100.0% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 47212 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 3886919 | 87.2% | |
| Common | 568956 | 12.8% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| r | 559647 | 14.4% | |
| e | 539897 | 13.9% | |
| o | 349606 | 9.0% | |
| d | 291428 | 7.5% | |
| l | 290673 | 7.5% | |
| s | 220409 | 5.7% | |
| c | 196369 | 5.1% | |
| i | 194467 | 5.0% | |
| m | 170813 | 4.4% | |
| n | 170487 | 4.4% | |
| F | 168352 | 4.3% | |
| h | 164505 | 4.2% | |
| C | 123769 | 3.2% | |
| A | 123769 | 3.2% | |
| u | 93895 | 2.4% | |
| t | 71541 | 1.8% | |
| a | 37763 | 1.0% | |
| f | 34175 | 0.9% | |
| N | 26808 | 0.7% | |
| b | 26808 | 0.7% | |
| T | 10112 | 0.3% | |
| y | 8210 | 0.2% | |
| P | 6265 | 0.2% | |
| p | 3997 | 0.1% | |
| U | 3154 | 0.1% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 521744 | 91.7% | ||
| - | 47212 | 8.3% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 4455875 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| r | 559647 | 12.6% | |
| e | 539897 | 12.1% | |
| 521744 | 11.7% | ||
| o | 349606 | 7.8% | |
| d | 291428 | 6.5% | |
| l | 290673 | 6.5% | |
| s | 220409 | 4.9% | |
| c | 196369 | 4.4% | |
| i | 194467 | 4.4% | |
| m | 170813 | 3.8% | |
| n | 170487 | 3.8% | |
| F | 168352 | 3.8% | |
| h | 164505 | 3.7% | |
| C | 123769 | 2.8% | |
| A | 123769 | 2.8% | |
| u | 93895 | 2.1% | |
| t | 71541 | 1.6% | |
| - | 47212 | 1.1% | |
| a | 37763 | 0.8% | |
| f | 34175 | 0.8% | |
| N | 26808 | 0.6% | |
| b | 26808 | 0.6% | |
| T | 10112 | 0.2% | |
| y | 8210 | 0.2% | |
| P | 6265 | 0.1% | |
| Other values (2) | 7151 | 0.2% |
| Distinct | 132 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 434.7189898 |
|---|---|
| Minimum | 0 |
| Maximum | 99999 |
| Zeros | 192144 |
| Zeros (%) | 96.3% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 99999 |
| Range | 99999 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 4697.53128 |
|---|---|
| Coefficient of variation (CV) | 10.8059031 |
| Kurtosis | 393.0628325 |
| Mean | 434.7189898 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 18.99082234 |
| Sum | 86736437 |
| Variance | 22066800.12 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 192144 | 96.3% | |
| 15024 | 788 | 0.4% | |
| 7688 | 609 | 0.3% | |
| 7298 | 582 | 0.3% | |
| 99999 | 390 | 0.2% | |
| 3103 | 237 | 0.1% | |
| 5178 | 207 | 0.1% | |
| 5013 | 158 | 0.1% | |
| 4386 | 151 | 0.1% | |
| 3325 | 121 | 0.1% | |
| 8614 | 118 | 0.1% | |
| 10520 | 98 | < 0.1% | |
| 27828 | 94 | < 0.1% | |
| 4650 | 93 | < 0.1% | |
| 20051 | 91 | < 0.1% | |
| 594 | 88 | < 0.1% | |
| 4064 | 83 | < 0.1% | |
| 2174 | 83 | < 0.1% | |
| 1086 | 81 | < 0.1% | |
| 14084 | 77 | < 0.1% | |
| 1409 | 77 | < 0.1% | |
| 13550 | 74 | < 0.1% | |
| 2829 | 71 | < 0.1% | |
| 10605 | 70 | < 0.1% | |
| 9386 | 70 | < 0.1% | |
| Other values (107) | 2868 | 1.4% |
| Value | Count | Frequency (%) | |
| 0 | 192144 | 96.3% | |
| 114 | 11 | < 0.1% | |
| 401 | 33 | < 0.1% | |
| 594 | 88 | < 0.1% | |
| 914 | 17 | < 0.1% | |
| 991 | 59 | < 0.1% | |
| 1055 | 69 | < 0.1% | |
| 1086 | 81 | < 0.1% | |
| 1090 | 2 | < 0.1% | |
| 1111 | 4 | < 0.1% |
| Value | Count | Frequency (%) | |
| 99999 | 390 | 0.2% | |
| 41310 | 2 | < 0.1% | |
| 34095 | 11 | < 0.1% | |
| 27828 | 94 | < 0.1% | |
| 25236 | 23 | < 0.1% | |
| 25124 | 18 | < 0.1% | |
| 22040 | 2 | < 0.1% | |
| 20051 | 91 | < 0.1% | |
| 18481 | 14 | < 0.1% | |
| 15831 | 16 | < 0.1% |
| Distinct | 113 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 37.31378839 |
|---|---|
| Minimum | 0 |
| Maximum | 4608 |
| Zeros | 195617 |
| Zeros (%) | 98.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 4608 |
| Range | 4608 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 271.8964284 |
|---|---|
| Coefficient of variation (CV) | 7.286754847 |
| Kurtosis | 61.63293305 |
| Mean | 37.31378839 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 7.6325647 |
| Sum | 7444959 |
| Variance | 73927.66776 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 195617 | 98.0% | |
| 1902 | 407 | 0.2% | |
| 1977 | 381 | 0.2% | |
| 1887 | 364 | 0.2% | |
| 1602 | 193 | 0.1% | |
| 2415 | 122 | 0.1% | |
| 1485 | 95 | < 0.1% | |
| 1848 | 88 | < 0.1% | |
| 1876 | 87 | < 0.1% | |
| 1672 | 85 | < 0.1% | |
| 1590 | 84 | < 0.1% | |
| 1740 | 72 | < 0.1% | |
| 2339 | 61 | < 0.1% | |
| 1564 | 60 | < 0.1% | |
| 1980 | 59 | < 0.1% | |
| 1741 | 58 | < 0.1% | |
| 1408 | 56 | < 0.1% | |
| 1719 | 56 | < 0.1% | |
| 2001 | 56 | < 0.1% | |
| 2258 | 56 | < 0.1% | |
| 1669 | 55 | < 0.1% | |
| 1974 | 51 | < 0.1% | |
| 2002 | 51 | < 0.1% | |
| 2377 | 48 | < 0.1% | |
| 2205 | 46 | < 0.1% | |
| Other values (88) | 1215 | 0.6% |
| Value | Count | Frequency (%) | |
| 0 | 195617 | 98.0% | |
| 155 | 1 | < 0.1% | |
| 213 | 10 | < 0.1% | |
| 323 | 10 | < 0.1% | |
| 419 | 29 | < 0.1% | |
| 625 | 25 | < 0.1% | |
| 653 | 7 | < 0.1% | |
| 772 | 5 | < 0.1% | |
| 810 | 5 | < 0.1% | |
| 880 | 9 | < 0.1% |
| Value | Count | Frequency (%) | |
| 4608 | 4 | < 0.1% | |
| 4356 | 30 | < 0.1% | |
| 3900 | 2 | < 0.1% | |
| 3770 | 5 | < 0.1% | |
| 3683 | 4 | < 0.1% | |
| 3500 | 10 | < 0.1% | |
| 3175 | 8 | < 0.1% | |
| 3004 | 11 | < 0.1% | |
| 2824 | 27 | < 0.1% | |
| 2788 | 7 | < 0.1% |
| Distinct | 1478 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 197.5295329 |
|---|---|
| Minimum | 0 |
| Maximum | 99999 |
| Zeros | 178382 |
| Zeros (%) | 89.4% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 400 |
| Maximum | 99999 |
| Range | 99999 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1984.163658 |
|---|---|
| Coefficient of variation (CV) | 10.04489622 |
| Kurtosis | 1090.563754 |
| Mean | 197.5295329 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 27.78650179 |
| Sum | 39411685 |
| Variance | 3936905.423 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 178382 | 89.4% | |
| 100 | 1148 | 0.6% | |
| 500 | 1030 | 0.5% | |
| 1000 | 894 | 0.4% | |
| 200 | 866 | 0.4% | |
| 50 | 832 | 0.4% | |
| 2000 | 574 | 0.3% | |
| 250 | 555 | 0.3% | |
| 150 | 549 | 0.3% | |
| 300 | 523 | 0.3% | |
| 1 | 472 | 0.2% | |
| 400 | 409 | 0.2% | |
| 1500 | 380 | 0.2% | |
| 2500 | 372 | 0.2% | |
| 25 | 360 | 0.2% | |
| 5000 | 304 | 0.2% | |
| 3000 | 292 | 0.1% | |
| 600 | 287 | 0.1% | |
| 10 | 253 | 0.1% | |
| 4000 | 222 | 0.1% | |
| 20 | 213 | 0.1% | |
| 2 | 193 | 0.1% | |
| 10000 | 182 | 0.1% | |
| 5 | 179 | 0.1% | |
| 125 | 175 | 0.1% | |
| Other values (1453) | 9877 | 5.0% |
| Value | Count | Frequency (%) | |
| 0 | 178382 | 89.4% | |
| 1 | 472 | 0.2% | |
| 2 | 193 | 0.1% | |
| 3 | 129 | 0.1% | |
| 4 | 75 | < 0.1% | |
| 5 | 179 | 0.1% | |
| 6 | 100 | 0.1% | |
| 7 | 93 | < 0.1% | |
| 8 | 94 | < 0.1% | |
| 9 | 56 | < 0.1% |
| Value | Count | Frequency (%) | |
| 99999 | 25 | < 0.1% | |
| 95095 | 1 | < 0.1% | |
| 75000 | 5 | < 0.1% | |
| 70000 | 3 | < 0.1% | |
| 66621 | 2 | < 0.1% | |
| 60000 | 7 | < 0.1% | |
| 57678 | 1 | < 0.1% | |
| 55000 | 1 | < 0.1% | |
| 54600 | 2 | < 0.1% | |
| 54500 | 2 | < 0.1% |
tax_filer_stat
Categorical
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Nonfiler | |
|---|---|
| Joint both under 65 | |
| Single | |
| Joint both 65+ | |
| Head of household | 7426 |
| Value | Count | Frequency (%) | |
| Nonfiler | 75094 | 37.6% | |
| Joint both under 65 | 67383 | 33.8% | |
| Single | 37421 | 18.8% | |
| Joint both 65+ | 8332 | 4.2% | |
| Head of household | 7426 | 3.7% | |
| Joint one under 65 & one 65+ | 3867 | 1.9% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 28 |
|---|---|
| Median length | 8 |
| Mean length | 12.31297144 |
| Min length | 6 |
Most occurring characters
| Value | Count | Frequency (%) | |
| n | 271081 | 11.0% | |
| o | 260403 | 10.6% | |
| 256867 | 10.5% | ||
| e | 206351 | 8.4% | |
| i | 192097 | 7.8% | |
| t | 155297 | 6.3% | |
| r | 146344 | 6.0% | |
| l | 119941 | 4.9% | |
| h | 90567 | 3.7% | |
| d | 86102 | 3.5% | |
| 6 | 83449 | 3.4% | |
| 5 | 83449 | 3.4% | |
| f | 82520 | 3.4% | |
| J | 79582 | 3.2% | |
| u | 78676 | 3.2% | |
| b | 75715 | 3.1% | |
| N | 75094 | 3.1% | |
| S | 37421 | 1.5% | |
| g | 37421 | 1.5% | |
| + | 12199 | 0.5% | |
| H | 7426 | 0.3% | |
| a | 7426 | 0.3% | |
| s | 7426 | 0.3% | |
| & | 3867 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 1817367 | 74.0% | |
| Space Separator | 256867 | 10.5% | |
| Uppercase Letter | 199523 | 8.1% | |
| Decimal Number | 166898 | 6.8% | |
| Math Symbol | 12199 | 0.5% | |
| Other Punctuation | 3867 | 0.2% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| J | 79582 | 39.9% | |
| N | 75094 | 37.6% | |
| S | 37421 | 18.8% | |
| H | 7426 | 3.7% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 271081 | 14.9% | |
| o | 260403 | 14.3% | |
| e | 206351 | 11.4% | |
| i | 192097 | 10.6% | |
| t | 155297 | 8.5% | |
| r | 146344 | 8.1% | |
| l | 119941 | 6.6% | |
| h | 90567 | 5.0% | |
| d | 86102 | 4.7% | |
| f | 82520 | 4.5% | |
| u | 78676 | 4.3% | |
| b | 75715 | 4.2% | |
| g | 37421 | 2.1% | |
| a | 7426 | 0.4% | |
| s | 7426 | 0.4% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 256867 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 6 | 83449 | 50.0% | |
| 5 | 83449 | 50.0% |
Most frequent Math Symbol characters
| Value | Count | Frequency (%) | |
| + | 12199 | 100.0% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| & | 3867 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 2016890 | 82.1% | |
| Common | 439831 | 17.9% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| n | 271081 | 13.4% | |
| o | 260403 | 12.9% | |
| e | 206351 | 10.2% | |
| i | 192097 | 9.5% | |
| t | 155297 | 7.7% | |
| r | 146344 | 7.3% | |
| l | 119941 | 5.9% | |
| h | 90567 | 4.5% | |
| d | 86102 | 4.3% | |
| f | 82520 | 4.1% | |
| J | 79582 | 3.9% | |
| u | 78676 | 3.9% | |
| b | 75715 | 3.8% | |
| N | 75094 | 3.7% | |
| S | 37421 | 1.9% | |
| g | 37421 | 1.9% | |
| H | 7426 | 0.4% | |
| a | 7426 | 0.4% | |
| s | 7426 | 0.4% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 256867 | 58.4% | ||
| 6 | 83449 | 19.0% | |
| 5 | 83449 | 19.0% | |
| + | 12199 | 2.8% | |
| & | 3867 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 2456721 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| n | 271081 | 11.0% | |
| o | 260403 | 10.6% | |
| 256867 | 10.5% | ||
| e | 206351 | 8.4% | |
| i | 192097 | 7.8% | |
| t | 155297 | 6.3% | |
| r | 146344 | 6.0% | |
| l | 119941 | 4.9% | |
| h | 90567 | 3.7% | |
| d | 86102 | 3.5% | |
| 6 | 83449 | 3.4% | |
| 5 | 83449 | 3.4% | |
| f | 82520 | 3.4% | |
| J | 79582 | 3.2% | |
| u | 78676 | 3.2% | |
| b | 75715 | 3.1% | |
| N | 75094 | 3.1% | |
| S | 37421 | 1.5% | |
| g | 37421 | 1.5% | |
| + | 12199 | 0.5% | |
| H | 7426 | 0.3% | |
| a | 7426 | 0.3% | |
| s | 7426 | 0.3% | |
| & | 3867 | 0.2% |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Not in universe | |
|---|---|
| South | 4889 |
| West | 4074 |
| Midwest | 3575 |
| Northeast | 2705 |
| Value | Count | Frequency (%) | |
| Not in universe | 183750 | 92.1% | |
| South | 4889 | 2.5% | |
| West | 4074 | 2.0% | |
| Midwest | 3575 | 1.8% | |
| Northeast | 2705 | 1.4% | |
| Abroad | 530 | 0.3% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 15 |
|---|---|
| Median length | 15 |
| Mean length | 14.28176701 |
| Min length | 4 |
Most occurring characters
| Value | Count | Frequency (%) | |
| e | 377854 | 13.3% | |
| i | 371075 | 13.0% | |
| 367500 | 12.9% | ||
| n | 367500 | 12.9% | |
| t | 201698 | 7.1% | |
| s | 194104 | 6.8% | |
| o | 191874 | 6.7% | |
| u | 188639 | 6.6% | |
| r | 186985 | 6.6% | |
| N | 186455 | 6.5% | |
| v | 183750 | 6.4% | |
| h | 7594 | 0.3% | |
| S | 4889 | 0.2% | |
| d | 4105 | 0.1% | |
| W | 4074 | 0.1% | |
| M | 3575 | 0.1% | |
| w | 3575 | 0.1% | |
| a | 3235 | 0.1% | |
| A | 530 | < 0.1% | |
| b | 530 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 2282518 | 80.1% | |
| Space Separator | 367500 | 12.9% | |
| Uppercase Letter | 199523 | 7.0% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| N | 186455 | 93.5% | |
| S | 4889 | 2.5% | |
| W | 4074 | 2.0% | |
| M | 3575 | 1.8% | |
| A | 530 | 0.3% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| e | 377854 | 16.6% | |
| i | 371075 | 16.3% | |
| n | 367500 | 16.1% | |
| t | 201698 | 8.8% | |
| s | 194104 | 8.5% | |
| o | 191874 | 8.4% | |
| u | 188639 | 8.3% | |
| r | 186985 | 8.2% | |
| v | 183750 | 8.1% | |
| h | 7594 | 0.3% | |
| d | 4105 | 0.2% | |
| w | 3575 | 0.2% | |
| a | 3235 | 0.1% | |
| b | 530 | < 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 367500 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 2482041 | 87.1% | |
| Common | 367500 | 12.9% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| e | 377854 | 15.2% | |
| i | 371075 | 15.0% | |
| n | 367500 | 14.8% | |
| t | 201698 | 8.1% | |
| s | 194104 | 7.8% | |
| o | 191874 | 7.7% | |
| u | 188639 | 7.6% | |
| r | 186985 | 7.5% | |
| N | 186455 | 7.5% | |
| v | 183750 | 7.4% | |
| h | 7594 | 0.3% | |
| S | 4889 | 0.2% | |
| d | 4105 | 0.2% | |
| W | 4074 | 0.2% | |
| M | 3575 | 0.1% | |
| w | 3575 | 0.1% | |
| a | 3235 | 0.1% | |
| A | 530 | < 0.1% | |
| b | 530 | < 0.1% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 367500 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 2849541 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| e | 377854 | 13.3% | |
| i | 371075 | 13.0% | |
| 367500 | 12.9% | ||
| n | 367500 | 12.9% | |
| t | 201698 | 7.1% | |
| s | 194104 | 6.8% | |
| o | 191874 | 6.7% | |
| u | 188639 | 6.6% | |
| r | 186985 | 6.6% | |
| N | 186455 | 6.5% | |
| v | 183750 | 6.4% | |
| h | 7594 | 0.3% | |
| S | 4889 | 0.2% | |
| d | 4105 | 0.1% | |
| W | 4074 | 0.1% | |
| M | 3575 | 0.1% | |
| w | 3575 | 0.1% | |
| a | 3235 | 0.1% | |
| A | 530 | < 0.1% | |
| b | 530 | < 0.1% |
| Distinct | 51 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Not in universe | |
|---|---|
| California | 1714 |
| Utah | 1063 |
| Florida | 849 |
| North Carolina | 812 |
| Other values (46) | 11335 |
| Value | Count | Frequency (%) | |
| Not in universe | 183750 | 92.1% | |
| California | 1714 | 0.9% | |
| Utah | 1063 | 0.5% | |
| Florida | 849 | 0.4% | |
| North Carolina | 812 | 0.4% | |
| ? | 708 | 0.4% | |
| Abroad | 671 | 0.3% | |
| Oklahoma | 626 | 0.3% | |
| Minnesota | 576 | 0.3% | |
| Indiana | 533 | 0.3% | |
| North Dakota | 499 | 0.3% | |
| New Mexico | 463 | 0.2% | |
| Michigan | 441 | 0.2% | |
| Alaska | 290 | 0.1% | |
| Kentucky | 244 | 0.1% | |
| Arizona | 243 | 0.1% | |
| New Hampshire | 242 | 0.1% | |
| Wyoming | 241 | 0.1% | |
| Colorado | 239 | 0.1% | |
| Oregon | 236 | 0.1% | |
| West Virginia | 231 | 0.1% | |
| Georgia | 227 | 0.1% | |
| Montana | 226 | 0.1% | |
| Alabama | 216 | 0.1% | |
| Ohio | 211 | 0.1% | |
| Other values (26) | 3972 | 2.0% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 20 |
|---|---|
| Median length | 15 |
| Mean length | 14.45687465 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| i | 380324 | 13.2% | |
| n | 377218 | 13.1% | |
| e | 373184 | 12.9% | |
| 370482 | 12.8% | ||
| o | 195445 | 6.8% | |
| r | 192090 | 6.7% | |
| s | 189330 | 6.6% | |
| t | 189230 | 6.6% | |
| N | 186388 | 6.5% | |
| u | 184978 | 6.4% | |
| v | 184123 | 6.4% | |
| a | 19048 | 0.7% | |
| l | 5725 | 0.2% | |
| h | 4309 | 0.1% | |
| C | 3093 | 0.1% | |
| d | 2633 | 0.1% | |
| M | 2539 | 0.1% | |
| k | 2375 | 0.1% | |
| f | 1830 | 0.1% | |
| c | 1754 | 0.1% | |
| m | 1632 | 0.1% | |
| A | 1625 | 0.1% | |
| g | 1502 | 0.1% | |
| w | 1237 | < 0.1% | |
| b | 1181 | < 0.1% | |
| Other values (21) | 11204 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 2311608 | 80.1% | |
| Space Separator | 370482 | 12.8% | |
| Uppercase Letter | 201681 | 7.0% | |
| Other Punctuation | 708 | < 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| N | 186388 | 92.4% | |
| C | 3093 | 1.5% | |
| M | 2539 | 1.3% | |
| A | 1625 | 0.8% | |
| O | 1073 | 0.5% | |
| U | 1063 | 0.5% | |
| I | 933 | 0.5% | |
| F | 849 | 0.4% | |
| D | 826 | 0.4% | |
| W | 577 | 0.3% | |
| V | 548 | 0.3% | |
| T | 411 | 0.2% | |
| K | 393 | 0.2% | |
| H | 242 | 0.1% | |
| S | 233 | 0.1% | |
| G | 227 | 0.1% | |
| P | 199 | 0.1% | |
| Y | 195 | 0.1% | |
| L | 192 | 0.1% | |
| J | 75 | < 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| i | 380324 | 16.5% | |
| n | 377218 | 16.3% | |
| e | 373184 | 16.1% | |
| o | 195445 | 8.5% | |
| r | 192090 | 8.3% | |
| s | 189330 | 8.2% | |
| t | 189230 | 8.2% | |
| u | 184978 | 8.0% | |
| v | 184123 | 8.0% | |
| a | 19048 | 0.8% | |
| l | 5725 | 0.2% | |
| h | 4309 | 0.2% | |
| d | 2633 | 0.1% | |
| k | 2375 | 0.1% | |
| f | 1830 | 0.1% | |
| c | 1754 | 0.1% | |
| m | 1632 | 0.1% | |
| g | 1502 | 0.1% | |
| w | 1237 | 0.1% | |
| b | 1181 | 0.1% | |
| y | 895 | < 0.1% | |
| x | 672 | < 0.1% | |
| p | 650 | < 0.1% | |
| z | 243 | < 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 370482 | 100.0% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| ? | 708 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 2513289 | 87.1% | |
| Common | 371190 | 12.9% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| i | 380324 | 15.1% | |
| n | 377218 | 15.0% | |
| e | 373184 | 14.8% | |
| o | 195445 | 7.8% | |
| r | 192090 | 7.6% | |
| s | 189330 | 7.5% | |
| t | 189230 | 7.5% | |
| N | 186388 | 7.4% | |
| u | 184978 | 7.4% | |
| v | 184123 | 7.3% | |
| a | 19048 | 0.8% | |
| l | 5725 | 0.2% | |
| h | 4309 | 0.2% | |
| C | 3093 | 0.1% | |
| d | 2633 | 0.1% | |
| M | 2539 | 0.1% | |
| k | 2375 | 0.1% | |
| f | 1830 | 0.1% | |
| c | 1754 | 0.1% | |
| m | 1632 | 0.1% | |
| A | 1625 | 0.1% | |
| g | 1502 | 0.1% | |
| w | 1237 | < 0.1% | |
| b | 1181 | < 0.1% | |
| O | 1073 | < 0.1% | |
| Other values (19) | 9423 | 0.4% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 370482 | 99.8% | ||
| ? | 708 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 2884479 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| i | 380324 | 13.2% | |
| n | 377218 | 13.1% | |
| e | 373184 | 12.9% | |
| 370482 | 12.8% | ||
| o | 195445 | 6.8% | |
| r | 192090 | 6.7% | |
| s | 189330 | 6.6% | |
| t | 189230 | 6.6% | |
| N | 186388 | 6.5% | |
| u | 184978 | 6.4% | |
| v | 184123 | 6.4% | |
| a | 19048 | 0.7% | |
| l | 5725 | 0.2% | |
| h | 4309 | 0.1% | |
| C | 3093 | 0.1% | |
| d | 2633 | 0.1% | |
| M | 2539 | 0.1% | |
| k | 2375 | 0.1% | |
| f | 1830 | 0.1% | |
| c | 1754 | 0.1% | |
| m | 1632 | 0.1% | |
| A | 1625 | 0.1% | |
| g | 1502 | 0.1% | |
| w | 1237 | < 0.1% | |
| b | 1181 | < 0.1% | |
| Other values (21) | 11204 | 0.4% |
| Distinct | 38 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Householder | |
|---|---|
| Child <18 never marr not in subfamily | |
| Spouse of householder | |
| Nonfamily householder | |
| Child 18+ never marr Not in a subfamily | |
| Other values (33) |
| Value | Count | Frequency (%) | |
| Householder | 53248 | 26.7% | |
| Child <18 never marr not in subfamily | 50326 | 25.2% | |
| Spouse of householder | 41695 | 20.9% | |
| Nonfamily householder | 22213 | 11.1% | |
| Child 18+ never marr Not in a subfamily | 12030 | 6.0% | |
| Secondary individual | 6122 | 3.1% | |
| Other Rel 18+ ever marr not in subfamily | 1956 | 1.0% | |
| Grandchild <18 never marr child of subfamily RP | 1868 | 0.9% | |
| Other Rel 18+ never marr not in subfamily | 1728 | 0.9% | |
| Grandchild <18 never marr not in subfamily | 1066 | 0.5% | |
| Child 18+ ever marr Not in a subfamily | 1013 | 0.5% | |
| Child under 18 of RP of unrel subfamily | 732 | 0.4% | |
| RP of unrelated subfamily | 685 | 0.3% | |
| Child 18+ ever marr RP of subfamily | 671 | 0.3% | |
| Other Rel 18+ ever marr RP of subfamily | 656 | 0.3% | |
| Other Rel <18 never marr child of subfamily RP | 656 | 0.3% | |
| Other Rel 18+ spouse of subfamily RP | 638 | 0.3% | |
| Child 18+ never marr RP of subfamily | 589 | 0.3% | |
| Other Rel <18 never marr not in subfamily | 584 | 0.3% | |
| Grandchild 18+ never marr not in subfamily | 375 | 0.2% | |
| In group quarters | 196 | 0.1% | |
| Child 18+ spouse of subfamily RP | 126 | 0.1% | |
| Other Rel 18+ never marr RP of subfamily | 94 | < 0.1% | |
| Child <18 never marr RP of subfamily | 80 | < 0.1% | |
| Spouse of RP of unrelated subfamily | 52 | < 0.1% | |
| Other values (13) | 124 | 0.1% |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Length
| Max length | 47 |
|---|---|
| Median length | 21 |
| Mean length | 24.71388762 |
| Min length | 11 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 588150 | 11.9% | ||
| e | 446352 | 9.1% | |
| o | 423897 | 8.6% | |
| r | 357168 | 7.2% | |
| l | 300845 | 6.1% | |
| h | 258900 | 5.3% | |
| i | 257293 | 5.2% | |
| u | 244446 | 5.0% | |
| s | 236706 | 4.8% | |
| n | 234893 | 4.8% | |
| d | 211877 | 4.3% | |
| a | 201655 | 4.1% | |
| m | 172063 | 3.5% | |
| f | 147639 | 3.0% | |
| y | 104384 | 2.1% | |
| v | 79923 | 1.6% | |
| t | 76410 | 1.5% | |
| b | 76049 | 1.5% | |
| 1 | 75312 | 1.5% | |
| 8 | 75312 | 1.5% | |
| C | 65614 | 1.3% | |
| < | 54645 | 1.1% | |
| H | 53248 | 1.1% | |
| S | 47869 | 1.0% | |
| p | 42722 | 0.9% | |
| Other values (10) | 97617 | 2.0% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 3885632 | 78.8% | |
| Space Separator | 588150 | 11.9% | |
| Uppercase Letter | 232003 | 4.7% | |
| Decimal Number | 150624 | 3.1% | |
| Math Symbol | 74580 | 1.5% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| C | 65614 | 28.3% | |
| H | 53248 | 23.0% | |
| S | 47869 | 20.6% | |
| N | 35256 | 15.2% | |
| R | 13224 | 5.7% | |
| P | 6898 | 3.0% | |
| O | 6326 | 2.7% | |
| G | 3372 | 1.5% | |
| I | 196 | 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| e | 446352 | 11.5% | |
| o | 423897 | 10.9% | |
| r | 357168 | 9.2% | |
| l | 300845 | 7.7% | |
| h | 258900 | 6.7% | |
| i | 257293 | 6.6% | |
| u | 244446 | 6.3% | |
| s | 236706 | 6.1% | |
| n | 234893 | 6.0% | |
| d | 211877 | 5.5% | |
| a | 201655 | 5.2% | |
| m | 172063 | 4.4% | |
| f | 147639 | 3.8% | |
| y | 104384 | 2.7% | |
| v | 79923 | 2.1% | |
| t | 76410 | 2.0% | |
| b | 76049 | 2.0% | |
| p | 42722 | 1.1% | |
| c | 12018 | 0.3% | |
| g | 196 | < 0.1% | |
| q | 196 | < 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 588150 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 1 | 75312 | 50.0% | |
| 8 | 75312 | 50.0% |
Most frequent Math Symbol characters
| Value | Count | Frequency (%) | |
| < | 54645 | 73.3% | |
| + | 19935 | 26.7% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 4117635 | 83.5% | |
| Common | 813354 | 16.5% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| e | 446352 | 10.8% | |
| o | 423897 | 10.3% | |
| r | 357168 | 8.7% | |
| l | 300845 | 7.3% | |
| h | 258900 | 6.3% | |
| i | 257293 | 6.2% | |
| u | 244446 | 5.9% | |
| s | 236706 | 5.7% | |
| n | 234893 | 5.7% | |
| d | 211877 | 5.1% | |
| a | 201655 | 4.9% | |
| m | 172063 | 4.2% | |
| f | 147639 | 3.6% | |
| y | 104384 | 2.5% | |
| v | 79923 | 1.9% | |
| t | 76410 | 1.9% | |
| b | 76049 | 1.8% | |
| C | 65614 | 1.6% | |
| H | 53248 | 1.3% | |
| S | 47869 | 1.2% | |
| p | 42722 | 1.0% | |
| N | 35256 | 0.9% | |
| R | 13224 | 0.3% | |
| c | 12018 | 0.3% | |
| P | 6898 | 0.2% | |
| Other values (5) | 10286 | 0.2% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 588150 | 72.3% | ||
| 1 | 75312 | 9.3% | |
| 8 | 75312 | 9.3% | |
| < | 54645 | 6.7% | |
| + | 19935 | 2.5% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 4930989 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 588150 | 11.9% | ||
| e | 446352 | 9.1% | |
| o | 423897 | 8.6% | |
| r | 357168 | 7.2% | |
| l | 300845 | 6.1% | |
| h | 258900 | 5.3% | |
| i | 257293 | 5.2% | |
| u | 244446 | 5.0% | |
| s | 236706 | 4.8% | |
| n | 234893 | 4.8% | |
| d | 211877 | 4.3% | |
| a | 201655 | 4.1% | |
| m | 172063 | 3.5% | |
| f | 147639 | 3.0% | |
| y | 104384 | 2.1% | |
| v | 79923 | 1.6% | |
| t | 76410 | 1.5% | |
| b | 76049 | 1.5% | |
| 1 | 75312 | 1.5% | |
| 8 | 75312 | 1.5% | |
| C | 65614 | 1.3% | |
| < | 54645 | 1.1% | |
| H | 53248 | 1.1% | |
| S | 47869 | 1.0% | |
| p | 42722 | 0.9% | |
| Other values (10) | 97617 | 2.0% |
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Householder | |
|---|---|
| Child under 18 never married | |
| Spouse of householder | |
| Child 18 or older | |
| Other relative of householder | |
| Other values (3) |
| Value | Count | Frequency (%) | |
| Householder | 75475 | 37.8% | |
| Child under 18 never married | 50426 | 25.3% | |
| Spouse of householder | 41709 | 20.9% | |
| Child 18 or older | 14430 | 7.2% | |
| Other relative of householder | 9703 | 4.9% | |
| Nonrelative of householder | 7601 | 3.8% | |
| Group Quarters- Secondary individual | 132 | 0.1% | |
| Child under 18 ever married | 47 | < 0.1% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 36 |
|---|---|
| Median length | 21 |
| Mean length | 19.28793172 |
| Min length | 11 |
Most occurring characters
| Value | Count | Frequency (%) | |
| e | 571582 | 14.9% | |
| o | 406423 | 10.6% | |
| r | 392775 | 10.2% | |
| 373307 | 9.7% | ||
| d | 315163 | 8.2% | |
| h | 268107 | 7.0% | |
| l | 231257 | 6.0% | |
| u | 227066 | 5.9% | |
| s | 176329 | 4.6% | |
| i | 133076 | 3.5% | |
| n | 108764 | 2.8% | |
| H | 75475 | 2.0% | |
| a | 68173 | 1.8% | |
| v | 67909 | 1.8% | |
| C | 64903 | 1.7% | |
| 1 | 64903 | 1.7% | |
| 8 | 64903 | 1.7% | |
| f | 59013 | 1.5% | |
| m | 50473 | 1.3% | |
| S | 41841 | 1.1% | |
| p | 41841 | 1.1% | |
| t | 27139 | 0.7% | |
| O | 9703 | 0.3% | |
| N | 7601 | 0.2% | |
| G | 132 | < 0.1% | |
| Other values (4) | 528 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 3145354 | 81.7% | |
| Space Separator | 373307 | 9.7% | |
| Uppercase Letter | 199787 | 5.2% | |
| Decimal Number | 129806 | 3.4% | |
| Dash Punctuation | 132 | < 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| H | 75475 | 37.8% | |
| C | 64903 | 32.5% | |
| S | 41841 | 20.9% | |
| O | 9703 | 4.9% | |
| N | 7601 | 3.8% | |
| G | 132 | 0.1% | |
| Q | 132 | 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| e | 571582 | 18.2% | |
| o | 406423 | 12.9% | |
| r | 392775 | 12.5% | |
| d | 315163 | 10.0% | |
| h | 268107 | 8.5% | |
| l | 231257 | 7.4% | |
| u | 227066 | 7.2% | |
| s | 176329 | 5.6% | |
| i | 133076 | 4.2% | |
| n | 108764 | 3.5% | |
| a | 68173 | 2.2% | |
| v | 67909 | 2.2% | |
| f | 59013 | 1.9% | |
| m | 50473 | 1.6% | |
| p | 41841 | 1.3% | |
| t | 27139 | 0.9% | |
| c | 132 | < 0.1% | |
| y | 132 | < 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 373307 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 1 | 64903 | 50.0% | |
| 8 | 64903 | 50.0% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 132 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 3345141 | 86.9% | |
| Common | 503245 | 13.1% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| e | 571582 | 17.1% | |
| o | 406423 | 12.1% | |
| r | 392775 | 11.7% | |
| d | 315163 | 9.4% | |
| h | 268107 | 8.0% | |
| l | 231257 | 6.9% | |
| u | 227066 | 6.8% | |
| s | 176329 | 5.3% | |
| i | 133076 | 4.0% | |
| n | 108764 | 3.3% | |
| H | 75475 | 2.3% | |
| a | 68173 | 2.0% | |
| v | 67909 | 2.0% | |
| C | 64903 | 1.9% | |
| f | 59013 | 1.8% | |
| m | 50473 | 1.5% | |
| S | 41841 | 1.3% | |
| p | 41841 | 1.3% | |
| t | 27139 | 0.8% | |
| O | 9703 | 0.3% | |
| N | 7601 | 0.2% | |
| G | 132 | < 0.1% | |
| Q | 132 | < 0.1% | |
| c | 132 | < 0.1% | |
| y | 132 | < 0.1% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 373307 | 74.2% | ||
| 1 | 64903 | 12.9% | |
| 8 | 64903 | 12.9% | |
| - | 132 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 3848386 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| e | 571582 | 14.9% | |
| o | 406423 | 10.6% | |
| r | 392775 | 10.2% | |
| 373307 | 9.7% | ||
| d | 315163 | 8.2% | |
| h | 268107 | 7.0% | |
| l | 231257 | 6.0% | |
| u | 227066 | 5.9% | |
| s | 176329 | 4.6% | |
| i | 133076 | 3.5% | |
| n | 108764 | 2.8% | |
| H | 75475 | 2.0% | |
| a | 68173 | 1.8% | |
| v | 67909 | 1.8% | |
| C | 64903 | 1.7% | |
| 1 | 64903 | 1.7% | |
| 8 | 64903 | 1.7% | |
| f | 59013 | 1.5% | |
| m | 50473 | 1.3% | |
| S | 41841 | 1.1% | |
| p | 41841 | 1.1% | |
| t | 27139 | 0.7% | |
| O | 9703 | 0.3% | |
| N | 7601 | 0.2% | |
| G | 132 | < 0.1% | |
| Other values (4) | 528 | < 0.1% |
instance_weight
Real number (ℝ≥0)
| Distinct | 99800 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1740.380269 |
|---|---|
| Minimum | 37.87 |
| Maximum | 18656.3 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 37.87 |
|---|---|
| 5-th percentile | 395.342 |
| Q1 | 1061.615 |
| median | 1618.31 |
| Q3 | 2188.61 |
| 95-th percentile | 3585.909 |
| Maximum | 18656.3 |
| Range | 18618.43 |
| Interquartile range (IQR) | 1126.995 |
Descriptive statistics
| Standard deviation | 993.7681558 |
|---|---|
| Coefficient of variation (CV) | 0.5710063331 |
| Kurtosis | 5.412514036 |
| Mean | 1740.380269 |
| Median Absolute Deviation (MAD) | 561.46 |
| Skewness | 1.432733152 |
| Sum | 347245892.5 |
| Variance | 987575.1475 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 1601.4 | 32 | < 0.1% | |
| 753.23 | 32 | < 0.1% | |
| 1191.21 | 32 | < 0.1% | |
| 1787.34 | 32 | < 0.1% | |
| 707.9 | 31 | < 0.1% | |
| 1317.51 | 31 | < 0.1% | |
| 1070.15 | 30 | < 0.1% | |
| 1839.19 | 28 | < 0.1% | |
| 1002.02 | 28 | < 0.1% | |
| 1009.39 | 28 | < 0.1% | |
| 1033.83 | 28 | < 0.1% | |
| 1029.73 | 27 | < 0.1% | |
| 1122.6 | 27 | < 0.1% | |
| 1528.84 | 27 | < 0.1% | |
| 964.5 | 26 | < 0.1% | |
| 1244.66 | 26 | < 0.1% | |
| 1011.71 | 26 | < 0.1% | |
| 1155.2 | 26 | < 0.1% | |
| 988.79 | 26 | < 0.1% | |
| 1882.96 | 26 | < 0.1% | |
| 974.01 | 26 | < 0.1% | |
| 1218.82 | 26 | < 0.1% | |
| 1138.19 | 25 | < 0.1% | |
| 1739.89 | 25 | < 0.1% | |
| 1032.82 | 25 | < 0.1% | |
| Other values (99775) | 198827 | 99.7% |
| Value | Count | Frequency (%) | |
| 37.87 | 1 | < 0.1% | |
| 39.11 | 1 | < 0.1% | |
| 40.67 | 2 | < 0.1% | |
| 42.82 | 2 | < 0.1% | |
| 43.26 | 3 | < 0.1% | |
| 45.74 | 2 | < 0.1% | |
| 47.83 | 6 | < 0.1% | |
| 49.82 | 2 | < 0.1% | |
| 52.43 | 1 | < 0.1% | |
| 52.46 | 4 | < 0.1% |
| Value | Count | Frequency (%) | |
| 18656.3 | 1 | < 0.1% | |
| 16349.2 | 1 | < 0.1% | |
| 13911.5 | 1 | < 0.1% | |
| 13145.1 | 1 | < 0.1% | |
| 13114.2 | 1 | < 0.1% | |
| 12960.2 | 1 | < 0.1% | |
| 12399.9 | 1 | < 0.1% | |
| 12184.5 | 1 | < 0.1% | |
| 11958.4 | 1 | < 0.1% | |
| 11863 | 1 | < 0.1% |
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| ? | |
|---|---|
| Nonmover | |
| MSA to MSA | |
| NonMSA to nonMSA | 2811 |
| Not in universe | 1516 |
| Other values (5) | 2361 |
| Value | Count | Frequency (%) | |
| ? | 99696 | 50.0% | |
| Nonmover | 82538 | 41.4% | |
| MSA to MSA | 10601 | 5.3% | |
| NonMSA to nonMSA | 2811 | 1.4% | |
| Not in universe | 1516 | 0.8% | |
| MSA to nonMSA | 790 | 0.4% | |
| NonMSA to MSA | 615 | 0.3% | |
| Abroad to MSA | 453 | 0.2% | |
| Not identifiable | 430 | 0.2% | |
| Abroad to nonMSA | 73 | < 0.1% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 16 |
|---|---|
| Median length | 8 |
| Mean length | 4.841186229 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| o | 189991 | 19.7% | |
| ? | 99696 | 10.3% | |
| n | 96774 | 10.0% | |
| N | 87910 | 9.1% | |
| e | 86430 | 8.9% | |
| r | 84580 | 8.8% | |
| v | 84054 | 8.7% | |
| m | 82538 | 8.5% | |
| 34148 | 3.5% | ||
| A | 30686 | 3.2% | |
| M | 30160 | 3.1% | |
| S | 30160 | 3.1% | |
| t | 17719 | 1.8% | |
| i | 4322 | 0.4% | |
| u | 1516 | 0.2% | |
| s | 1516 | 0.2% | |
| d | 956 | 0.1% | |
| a | 956 | 0.1% | |
| b | 956 | 0.1% | |
| f | 430 | < 0.1% | |
| l | 430 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 653168 | 67.6% | |
| Uppercase Letter | 178916 | 18.5% | |
| Other Punctuation | 99696 | 10.3% | |
| Space Separator | 34148 | 3.5% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| ? | 99696 | 100.0% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| N | 87910 | 49.1% | |
| A | 30686 | 17.2% | |
| M | 30160 | 16.9% | |
| S | 30160 | 16.9% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 34148 | 100.0% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| o | 189991 | 29.1% | |
| n | 96774 | 14.8% | |
| e | 86430 | 13.2% | |
| r | 84580 | 12.9% | |
| v | 84054 | 12.9% | |
| m | 82538 | 12.6% | |
| t | 17719 | 2.7% | |
| i | 4322 | 0.7% | |
| u | 1516 | 0.2% | |
| s | 1516 | 0.2% | |
| d | 956 | 0.1% | |
| a | 956 | 0.1% | |
| b | 956 | 0.1% | |
| f | 430 | 0.1% | |
| l | 430 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 832084 | 86.1% | |
| Common | 133844 | 13.9% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| ? | 99696 | 74.5% | |
| 34148 | 25.5% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| o | 189991 | 22.8% | |
| n | 96774 | 11.6% | |
| N | 87910 | 10.6% | |
| e | 86430 | 10.4% | |
| r | 84580 | 10.2% | |
| v | 84054 | 10.1% | |
| m | 82538 | 9.9% | |
| A | 30686 | 3.7% | |
| M | 30160 | 3.6% | |
| S | 30160 | 3.6% | |
| t | 17719 | 2.1% | |
| i | 4322 | 0.5% | |
| u | 1516 | 0.2% | |
| s | 1516 | 0.2% | |
| d | 956 | 0.1% | |
| a | 956 | 0.1% | |
| b | 956 | 0.1% | |
| f | 430 | 0.1% | |
| l | 430 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 965928 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| o | 189991 | 19.7% | |
| ? | 99696 | 10.3% | |
| n | 96774 | 10.0% | |
| N | 87910 | 9.1% | |
| e | 86430 | 8.9% | |
| r | 84580 | 8.8% | |
| v | 84054 | 8.7% | |
| m | 82538 | 8.5% | |
| 34148 | 3.5% | ||
| A | 30686 | 3.2% | |
| M | 30160 | 3.1% | |
| S | 30160 | 3.1% | |
| t | 17719 | 1.8% | |
| i | 4322 | 0.4% | |
| u | 1516 | 0.2% | |
| s | 1516 | 0.2% | |
| d | 956 | 0.1% | |
| a | 956 | 0.1% | |
| b | 956 | 0.1% | |
| f | 430 | < 0.1% | |
| l | 430 | < 0.1% |
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| ? | |
|---|---|
| Nonmover | |
| Same county | 9812 |
| Different county same state | 2797 |
| Not in universe | 1516 |
| Other values (4) | 3164 |
| Value | Count | Frequency (%) | |
| ? | 99696 | 50.0% | |
| Nonmover | 82538 | 41.4% | |
| Same county | 9812 | 4.9% | |
| Different county same state | 2797 | 1.4% | |
| Not in universe | 1516 | 0.8% | |
| Different region | 1178 | 0.6% | |
| Different state same division | 991 | 0.5% | |
| Abroad | 530 | 0.3% | |
| Different division same region | 465 | 0.2% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 30 |
|---|---|
| Median length | 6 |
| Mean length | 5.166862968 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| o | 182830 | 17.7% | |
| e | 115928 | 11.2% | |
| n | 106709 | 10.4% | |
| ? | 99696 | 9.7% | |
| m | 96603 | 9.4% | |
| r | 91658 | 8.9% | |
| v | 85510 | 8.3% | |
| N | 84054 | 8.2% | |
| t | 27132 | 2.6% | |
| 26781 | 2.6% | ||
| a | 18383 | 1.8% | |
| i | 14474 | 1.4% | |
| u | 14125 | 1.4% | |
| c | 12609 | 1.2% | |
| y | 12609 | 1.2% | |
| s | 11013 | 1.1% | |
| f | 10862 | 1.1% | |
| S | 9812 | 1.0% | |
| D | 5431 | 0.5% | |
| d | 1986 | 0.2% | |
| g | 1643 | 0.2% | |
| A | 530 | 0.1% | |
| b | 530 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 804604 | 78.0% | |
| Uppercase Letter | 99827 | 9.7% | |
| Other Punctuation | 99696 | 9.7% | |
| Space Separator | 26781 | 2.6% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| ? | 99696 | 100.0% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| N | 84054 | 84.2% | |
| S | 9812 | 9.8% | |
| D | 5431 | 5.4% | |
| A | 530 | 0.5% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| o | 182830 | 22.7% | |
| e | 115928 | 14.4% | |
| n | 106709 | 13.3% | |
| m | 96603 | 12.0% | |
| r | 91658 | 11.4% | |
| v | 85510 | 10.6% | |
| t | 27132 | 3.4% | |
| a | 18383 | 2.3% | |
| i | 14474 | 1.8% | |
| u | 14125 | 1.8% | |
| c | 12609 | 1.6% | |
| y | 12609 | 1.6% | |
| s | 11013 | 1.4% | |
| f | 10862 | 1.3% | |
| d | 1986 | 0.2% | |
| g | 1643 | 0.2% | |
| b | 530 | 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 26781 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 904431 | 87.7% | |
| Common | 126477 | 12.3% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| ? | 99696 | 78.8% | |
| 26781 | 21.2% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| o | 182830 | 20.2% | |
| e | 115928 | 12.8% | |
| n | 106709 | 11.8% | |
| m | 96603 | 10.7% | |
| r | 91658 | 10.1% | |
| v | 85510 | 9.5% | |
| N | 84054 | 9.3% | |
| t | 27132 | 3.0% | |
| a | 18383 | 2.0% | |
| i | 14474 | 1.6% | |
| u | 14125 | 1.6% | |
| c | 12609 | 1.4% | |
| y | 12609 | 1.4% | |
| s | 11013 | 1.2% | |
| f | 10862 | 1.2% | |
| S | 9812 | 1.1% | |
| D | 5431 | 0.6% | |
| d | 1986 | 0.2% | |
| g | 1643 | 0.2% | |
| A | 530 | 0.1% | |
| b | 530 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 1030908 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| o | 182830 | 17.7% | |
| e | 115928 | 11.2% | |
| n | 106709 | 10.4% | |
| ? | 99696 | 9.7% | |
| m | 96603 | 9.4% | |
| r | 91658 | 8.9% | |
| v | 85510 | 8.3% | |
| N | 84054 | 8.2% | |
| t | 27132 | 2.6% | |
| 26781 | 2.6% | ||
| a | 18383 | 1.8% | |
| i | 14474 | 1.4% | |
| u | 14125 | 1.4% | |
| c | 12609 | 1.2% | |
| y | 12609 | 1.2% | |
| s | 11013 | 1.1% | |
| f | 10862 | 1.1% | |
| S | 9812 | 1.0% | |
| D | 5431 | 0.5% | |
| d | 1986 | 0.2% | |
| g | 1643 | 0.2% | |
| A | 530 | 0.1% | |
| b | 530 | 0.1% |
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| ? | |
|---|---|
| Nonmover | |
| Same county | 9812 |
| Different county same state | 2797 |
| Not in universe | 1516 |
| Other values (5) | 3164 |
| Value | Count | Frequency (%) | |
| ? | 99696 | 50.0% | |
| Nonmover | 82538 | 41.4% | |
| Same county | 9812 | 4.9% | |
| Different county same state | 2797 | 1.4% | |
| Not in universe | 1516 | 0.8% | |
| Different state in South | 973 | 0.5% | |
| Different state in West | 679 | 0.3% | |
| Different state in Midwest | 551 | 0.3% | |
| Abroad | 530 | 0.3% | |
| Different state in Northeast | 431 | 0.2% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 28 |
|---|---|
| Median length | 6 |
| Mean length | 5.186038702 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| o | 181135 | 17.5% | |
| e | 116133 | 11.2% | |
| n | 106244 | 10.3% | |
| ? | 99696 | 9.6% | |
| m | 95147 | 9.2% | |
| r | 90446 | 8.7% | |
| N | 84485 | 8.2% | |
| v | 84054 | 8.1% | |
| t | 33483 | 3.2% | |
| 29137 | 2.8% | ||
| a | 19001 | 1.8% | |
| u | 15098 | 1.5% | |
| c | 12609 | 1.2% | |
| y | 12609 | 1.2% | |
| i | 11648 | 1.1% | |
| s | 11405 | 1.1% | |
| f | 10862 | 1.0% | |
| S | 10785 | 1.0% | |
| D | 5431 | 0.5% | |
| h | 1404 | 0.1% | |
| d | 1081 | 0.1% | |
| W | 679 | 0.1% | |
| M | 551 | 0.1% | |
| w | 551 | 0.1% | |
| A | 530 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 803440 | 77.6% | |
| Uppercase Letter | 102461 | 9.9% | |
| Other Punctuation | 99696 | 9.6% | |
| Space Separator | 29137 | 2.8% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| ? | 99696 | 100.0% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| N | 84485 | 82.5% | |
| S | 10785 | 10.5% | |
| D | 5431 | 5.3% | |
| W | 679 | 0.7% | |
| M | 551 | 0.5% | |
| A | 530 | 0.5% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| o | 181135 | 22.5% | |
| e | 116133 | 14.5% | |
| n | 106244 | 13.2% | |
| m | 95147 | 11.8% | |
| r | 90446 | 11.3% | |
| v | 84054 | 10.5% | |
| t | 33483 | 4.2% | |
| a | 19001 | 2.4% | |
| u | 15098 | 1.9% | |
| c | 12609 | 1.6% | |
| y | 12609 | 1.6% | |
| i | 11648 | 1.4% | |
| s | 11405 | 1.4% | |
| f | 10862 | 1.4% | |
| h | 1404 | 0.2% | |
| d | 1081 | 0.1% | |
| w | 551 | 0.1% | |
| b | 530 | 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 29137 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 905901 | 87.5% | |
| Common | 128833 | 12.5% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| ? | 99696 | 77.4% | |
| 29137 | 22.6% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| o | 181135 | 20.0% | |
| e | 116133 | 12.8% | |
| n | 106244 | 11.7% | |
| m | 95147 | 10.5% | |
| r | 90446 | 10.0% | |
| N | 84485 | 9.3% | |
| v | 84054 | 9.3% | |
| t | 33483 | 3.7% | |
| a | 19001 | 2.1% | |
| u | 15098 | 1.7% | |
| c | 12609 | 1.4% | |
| y | 12609 | 1.4% | |
| i | 11648 | 1.3% | |
| s | 11405 | 1.3% | |
| f | 10862 | 1.2% | |
| S | 10785 | 1.2% | |
| D | 5431 | 0.6% | |
| h | 1404 | 0.2% | |
| d | 1081 | 0.1% | |
| W | 679 | 0.1% | |
| M | 551 | 0.1% | |
| w | 551 | 0.1% | |
| A | 530 | 0.1% | |
| b | 530 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 1034734 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| o | 181135 | 17.5% | |
| e | 116133 | 11.2% | |
| n | 106244 | 10.3% | |
| ? | 99696 | 9.6% | |
| m | 95147 | 9.2% | |
| r | 90446 | 8.7% | |
| N | 84485 | 8.2% | |
| v | 84054 | 8.1% | |
| t | 33483 | 3.2% | |
| 29137 | 2.8% | ||
| a | 19001 | 1.8% | |
| u | 15098 | 1.5% | |
| c | 12609 | 1.2% | |
| y | 12609 | 1.2% | |
| i | 11648 | 1.1% | |
| s | 11405 | 1.1% | |
| f | 10862 | 1.0% | |
| S | 10785 | 1.0% | |
| D | 5431 | 0.5% | |
| h | 1404 | 0.1% | |
| d | 1081 | 0.1% | |
| W | 679 | 0.1% | |
| M | 551 | 0.1% | |
| w | 551 | 0.1% | |
| A | 530 | 0.1% |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Not in universe under 1 year old | |
|---|---|
| Yes | |
| No |
| Value | Count | Frequency (%) | |
| Not in universe under 1 year old | 101212 | 50.7% | |
| Yes | 82538 | 41.4% | |
| No | 15773 | 7.9% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 32 |
|---|---|
| Median length | 32 |
| Mean length | 17.63177178 |
| Min length | 2 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 607272 | 17.3% | ||
| e | 487386 | 13.9% | |
| n | 303636 | 8.6% | |
| r | 303636 | 8.6% | |
| o | 218197 | 6.2% | |
| i | 202424 | 5.8% | |
| u | 202424 | 5.8% | |
| d | 202424 | 5.8% | |
| s | 183750 | 5.2% | |
| N | 116985 | 3.3% | |
| t | 101212 | 2.9% | |
| v | 101212 | 2.9% | |
| 1 | 101212 | 2.9% | |
| y | 101212 | 2.9% | |
| a | 101212 | 2.9% | |
| l | 101212 | 2.9% | |
| Y | 82538 | 2.3% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 2609937 | 74.2% | |
| Space Separator | 607272 | 17.3% | |
| Uppercase Letter | 199523 | 5.7% | |
| Decimal Number | 101212 | 2.9% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| N | 116985 | 58.6% | |
| Y | 82538 | 41.4% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| e | 487386 | 18.7% | |
| n | 303636 | 11.6% | |
| r | 303636 | 11.6% | |
| o | 218197 | 8.4% | |
| i | 202424 | 7.8% | |
| u | 202424 | 7.8% | |
| d | 202424 | 7.8% | |
| s | 183750 | 7.0% | |
| t | 101212 | 3.9% | |
| v | 101212 | 3.9% | |
| y | 101212 | 3.9% | |
| a | 101212 | 3.9% | |
| l | 101212 | 3.9% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 607272 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 1 | 101212 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 2809460 | 79.9% | |
| Common | 708484 | 20.1% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| e | 487386 | 17.3% | |
| n | 303636 | 10.8% | |
| r | 303636 | 10.8% | |
| o | 218197 | 7.8% | |
| i | 202424 | 7.2% | |
| u | 202424 | 7.2% | |
| d | 202424 | 7.2% | |
| s | 183750 | 6.5% | |
| N | 116985 | 4.2% | |
| t | 101212 | 3.6% | |
| v | 101212 | 3.6% | |
| y | 101212 | 3.6% | |
| a | 101212 | 3.6% | |
| l | 101212 | 3.6% | |
| Y | 82538 | 2.9% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 607272 | 85.7% | ||
| 1 | 101212 | 14.3% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 3517944 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 607272 | 17.3% | ||
| e | 487386 | 13.9% | |
| n | 303636 | 8.6% | |
| r | 303636 | 8.6% | |
| o | 218197 | 6.2% | |
| i | 202424 | 5.8% | |
| u | 202424 | 5.8% | |
| d | 202424 | 5.8% | |
| s | 183750 | 5.2% | |
| N | 116985 | 3.3% | |
| t | 101212 | 2.9% | |
| v | 101212 | 2.9% | |
| 1 | 101212 | 2.9% | |
| y | 101212 | 2.9% | |
| a | 101212 | 2.9% | |
| l | 101212 | 2.9% | |
| Y | 82538 | 2.3% |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| ? | |
|---|---|
| Not in universe | |
| No | |
| Yes | 5786 |
| Value | Count | Frequency (%) | |
| ? | 99696 | 50.0% | |
| Not in universe | 84054 | 42.1% | |
| No | 9987 | 5.0% | |
| Yes | 5786 | 2.9% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 15 |
|---|---|
| Median length | 2 |
| Mean length | 7.005899069 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| e | 173894 | 12.4% | |
| 168108 | 12.0% | ||
| i | 168108 | 12.0% | |
| n | 168108 | 12.0% | |
| ? | 99696 | 7.1% | |
| N | 94041 | 6.7% | |
| o | 94041 | 6.7% | |
| s | 89840 | 6.4% | |
| t | 84054 | 6.0% | |
| u | 84054 | 6.0% | |
| v | 84054 | 6.0% | |
| r | 84054 | 6.0% | |
| Y | 5786 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 1030207 | 73.7% | |
| Space Separator | 168108 | 12.0% | |
| Uppercase Letter | 99827 | 7.1% | |
| Other Punctuation | 99696 | 7.1% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| ? | 99696 | 100.0% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| N | 94041 | 94.2% | |
| Y | 5786 | 5.8% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| e | 173894 | 16.9% | |
| i | 168108 | 16.3% | |
| n | 168108 | 16.3% | |
| o | 94041 | 9.1% | |
| s | 89840 | 8.7% | |
| t | 84054 | 8.2% | |
| u | 84054 | 8.2% | |
| v | 84054 | 8.2% | |
| r | 84054 | 8.2% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 168108 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 1130034 | 80.8% | |
| Common | 267804 | 19.2% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 168108 | 62.8% | ||
| ? | 99696 | 37.2% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| e | 173894 | 15.4% | |
| i | 168108 | 14.9% | |
| n | 168108 | 14.9% | |
| N | 94041 | 8.3% | |
| o | 94041 | 8.3% | |
| s | 89840 | 8.0% | |
| t | 84054 | 7.4% | |
| u | 84054 | 7.4% | |
| v | 84054 | 7.4% | |
| r | 84054 | 7.4% | |
| Y | 5786 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 1397838 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| e | 173894 | 12.4% | |
| 168108 | 12.0% | ||
| i | 168108 | 12.0% | |
| n | 168108 | 12.0% | |
| ? | 99696 | 7.1% | |
| N | 94041 | 6.7% | |
| o | 94041 | 6.7% | |
| s | 89840 | 6.4% | |
| t | 84054 | 6.0% | |
| u | 84054 | 6.0% | |
| v | 84054 | 6.0% | |
| r | 84054 | 6.0% | |
| Y | 5786 | 0.4% |
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.95618049 |
|---|---|
| Minimum | 0 |
| Maximum | 6 |
| Zeros | 95983 |
| Zeros (%) | 48.1% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.365125505 |
|---|---|
| Coefficient of variation (CV) | 1.209052803 |
| Kurtosis | -1.082246833 |
| Mean | 1.95618049 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.7515606804 |
| Sum | 390303 |
| Variance | 5.593818657 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 95983 | 48.1% | |
| 6 | 36511 | 18.3% | |
| 1 | 23109 | 11.6% | |
| 4 | 14379 | 7.2% | |
| 3 | 13425 | 6.7% | |
| 2 | 10081 | 5.1% | |
| 5 | 6035 | 3.0% |
| Value | Count | Frequency (%) | |
| 0 | 95983 | 48.1% | |
| 1 | 23109 | 11.6% | |
| 2 | 10081 | 5.1% | |
| 3 | 13425 | 6.7% | |
| 4 | 14379 | 7.2% | |
| 5 | 6035 | 3.0% | |
| 6 | 36511 | 18.3% |
| Value | Count | Frequency (%) | |
| 6 | 36511 | 18.3% | |
| 5 | 6035 | 3.0% | |
| 4 | 14379 | 7.2% | |
| 3 | 13425 | 6.7% | |
| 2 | 10081 | 5.1% | |
| 1 | 23109 | 11.6% | |
| 0 | 95983 | 48.1% |
family_members_under_18
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Not in universe | |
|---|---|
| Both parents present | |
| Mother only present | 12772 |
| Father only present | 1883 |
| Neither parent present | 1653 |
| Value | Count | Frequency (%) | |
| Not in universe | 144232 | 72.3% | |
| Both parents present | 38983 | 19.5% | |
| Mother only present | 12772 | 6.4% | |
| Father only present | 1883 | 0.9% | |
| Neither parent present | 1653 | 0.8% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 22 |
|---|---|
| Median length | 15 |
| Mean length | 16.32869895 |
| Min length | 15 |
Most occurring characters
| Value | Count | Frequency (%) | |
| e | 457643 | 14.0% | |
| 399046 | 12.2% | ||
| n | 399046 | 12.2% | |
| t | 295450 | 9.1% | |
| i | 290117 | 8.9% | |
| r | 256467 | 7.9% | |
| s | 238506 | 7.3% | |
| o | 210642 | 6.5% | |
| N | 145885 | 4.5% | |
| u | 144232 | 4.4% | |
| v | 144232 | 4.4% | |
| p | 95927 | 2.9% | |
| h | 55291 | 1.7% | |
| a | 42519 | 1.3% | |
| B | 38983 | 1.2% | |
| l | 14655 | 0.4% | |
| y | 14655 | 0.4% | |
| M | 12772 | 0.4% | |
| F | 1883 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 2659382 | 81.6% | |
| Space Separator | 399046 | 12.2% | |
| Uppercase Letter | 199523 | 6.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| N | 145885 | 73.1% | |
| B | 38983 | 19.5% | |
| M | 12772 | 6.4% | |
| F | 1883 | 0.9% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| e | 457643 | 17.2% | |
| n | 399046 | 15.0% | |
| t | 295450 | 11.1% | |
| i | 290117 | 10.9% | |
| r | 256467 | 9.6% | |
| s | 238506 | 9.0% | |
| o | 210642 | 7.9% | |
| u | 144232 | 5.4% | |
| v | 144232 | 5.4% | |
| p | 95927 | 3.6% | |
| h | 55291 | 2.1% | |
| a | 42519 | 1.6% | |
| l | 14655 | 0.6% | |
| y | 14655 | 0.6% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 399046 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 2858905 | 87.8% | |
| Common | 399046 | 12.2% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| e | 457643 | 16.0% | |
| n | 399046 | 14.0% | |
| t | 295450 | 10.3% | |
| i | 290117 | 10.1% | |
| r | 256467 | 9.0% | |
| s | 238506 | 8.3% | |
| o | 210642 | 7.4% | |
| N | 145885 | 5.1% | |
| u | 144232 | 5.0% | |
| v | 144232 | 5.0% | |
| p | 95927 | 3.4% | |
| h | 55291 | 1.9% | |
| a | 42519 | 1.5% | |
| B | 38983 | 1.4% | |
| l | 14655 | 0.5% | |
| y | 14655 | 0.5% | |
| M | 12772 | 0.4% | |
| F | 1883 | 0.1% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 399046 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 3257951 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| e | 457643 | 14.0% | |
| 399046 | 12.2% | ||
| n | 399046 | 12.2% | |
| t | 295450 | 9.1% | |
| i | 290117 | 8.9% | |
| r | 256467 | 7.9% | |
| s | 238506 | 7.3% | |
| o | 210642 | 6.5% | |
| N | 145885 | 4.5% | |
| u | 144232 | 4.4% | |
| v | 144232 | 4.4% | |
| p | 95927 | 2.9% | |
| h | 55291 | 1.7% | |
| a | 42519 | 1.3% | |
| B | 38983 | 1.2% | |
| l | 14655 | 0.4% | |
| y | 14655 | 0.4% | |
| M | 12772 | 0.4% | |
| F | 1883 | 0.1% |
country_of_birth_father
Categorical
| Distinct | 43 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| United-States | |
|---|---|
| Mexico | 10008 |
| ? | 6713 |
| Puerto-Rico | 2680 |
| Italy | 2212 |
| Other values (38) |
| Value | Count | Frequency (%) | |
| United-States | 159163 | 79.8% | |
| Mexico | 10008 | 5.0% | |
| ? | 6713 | 3.4% | |
| Puerto-Rico | 2680 | 1.3% | |
| Italy | 2212 | 1.1% | |
| Canada | 1380 | 0.7% | |
| Germany | 1356 | 0.7% | |
| Dominican-Republic | 1290 | 0.6% | |
| Poland | 1212 | 0.6% | |
| Philippines | 1154 | 0.6% | |
| Cuba | 1125 | 0.6% | |
| El-Salvador | 982 | 0.5% | |
| China | 856 | 0.4% | |
| England | 793 | 0.4% | |
| Columbia | 614 | 0.3% | |
| India | 580 | 0.3% | |
| South Korea | 530 | 0.3% | |
| Ireland | 508 | 0.3% | |
| Jamaica | 463 | 0.2% | |
| Vietnam | 457 | 0.2% | |
| Guatemala | 445 | 0.2% | |
| Japan | 392 | 0.2% | |
| Portugal | 388 | 0.2% | |
| Ecuador | 379 | 0.2% | |
| Haiti | 351 | 0.2% | |
| Other values (18) | 3492 | 1.8% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 28 |
|---|---|
| Median length | 13 |
| Mean length | 11.66875999 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| t | 485168 | 20.8% | |
| e | 338573 | 14.5% | |
| a | 185809 | 8.0% | |
| i | 184161 | 7.9% | |
| n | 173312 | 7.4% | |
| d | 166069 | 7.1% | |
| - | 164325 | 7.1% | |
| S | 161240 | 6.9% | |
| s | 160933 | 6.9% | |
| U | 159481 | 6.9% | |
| o | 22790 | 1.0% | |
| c | 17366 | 0.7% | |
| l | 11412 | 0.5% | |
| M | 10008 | 0.4% | |
| x | 10008 | 0.4% | |
| u | 9136 | 0.4% | |
| r | 8905 | 0.4% | |
| ? | 6713 | 0.3% | |
| P | 5794 | 0.2% | |
| m | 5005 | 0.2% | |
| C | 4171 | 0.2% | |
| y | 4033 | 0.2% | |
| p | 3990 | 0.2% | |
| R | 3970 | 0.2% | |
| I | 3692 | 0.2% | |
| Other values (22) | 22122 | 1.0% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 1796607 | 77.2% | |
| Uppercase Letter | 358838 | 15.4% | |
| Dash Punctuation | 164325 | 7.1% | |
| Other Punctuation | 6826 | 0.3% | |
| Space Separator | 1272 | 0.1% | |
| Open Punctuation | 159 | < 0.1% | |
| Close Punctuation | 159 | < 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| S | 161240 | 44.9% | |
| U | 159481 | 44.4% | |
| M | 10008 | 2.8% | |
| P | 5794 | 1.6% | |
| C | 4171 | 1.2% | |
| R | 3970 | 1.1% | |
| I | 3692 | 1.0% | |
| G | 2304 | 0.6% | |
| E | 2154 | 0.6% | |
| D | 1290 | 0.4% | |
| H | 1008 | 0.3% | |
| J | 855 | 0.2% | |
| K | 636 | 0.2% | |
| V | 616 | 0.2% | |
| T | 532 | 0.1% | |
| N | 366 | 0.1% | |
| Y | 217 | 0.1% | |
| F | 191 | 0.1% | |
| O | 159 | < 0.1% | |
| L | 154 | < 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| t | 485168 | 27.0% | |
| e | 338573 | 18.8% | |
| a | 185809 | 10.3% | |
| i | 184161 | 10.3% | |
| n | 173312 | 9.6% | |
| d | 166069 | 9.2% | |
| s | 160933 | 9.0% | |
| o | 22790 | 1.3% | |
| c | 17366 | 1.0% | |
| l | 11412 | 0.6% | |
| x | 10008 | 0.6% | |
| u | 9136 | 0.5% | |
| r | 8905 | 0.5% | |
| m | 5005 | 0.3% | |
| y | 4033 | 0.2% | |
| p | 3990 | 0.2% | |
| b | 3338 | 0.2% | |
| h | 2698 | 0.2% | |
| g | 2503 | 0.1% | |
| v | 1199 | 0.1% | |
| w | 199 | < 0.1% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 164325 | 100.0% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| ? | 6713 | 98.3% | |
| & | 113 | 1.7% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 1272 | 100.0% |
Most frequent Open Punctuation characters
| Value | Count | Frequency (%) | |
| ( | 159 | 100.0% |
Most frequent Close Punctuation characters
| Value | Count | Frequency (%) | |
| ) | 159 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 2155445 | 92.6% | |
| Common | 172741 | 7.4% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| t | 485168 | 22.5% | |
| e | 338573 | 15.7% | |
| a | 185809 | 8.6% | |
| i | 184161 | 8.5% | |
| n | 173312 | 8.0% | |
| d | 166069 | 7.7% | |
| S | 161240 | 7.5% | |
| s | 160933 | 7.5% | |
| U | 159481 | 7.4% | |
| o | 22790 | 1.1% | |
| c | 17366 | 0.8% | |
| l | 11412 | 0.5% | |
| M | 10008 | 0.5% | |
| x | 10008 | 0.5% | |
| u | 9136 | 0.4% | |
| r | 8905 | 0.4% | |
| P | 5794 | 0.3% | |
| m | 5005 | 0.2% | |
| C | 4171 | 0.2% | |
| y | 4033 | 0.2% | |
| p | 3990 | 0.2% | |
| R | 3970 | 0.2% | |
| I | 3692 | 0.2% | |
| b | 3338 | 0.2% | |
| h | 2698 | 0.1% | |
| Other values (16) | 14383 | 0.7% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| - | 164325 | 95.1% | |
| ? | 6713 | 3.9% | |
| 1272 | 0.7% | ||
| ( | 159 | 0.1% | |
| ) | 159 | 0.1% | |
| & | 113 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 2328186 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| t | 485168 | 20.8% | |
| e | 338573 | 14.5% | |
| a | 185809 | 8.0% | |
| i | 184161 | 7.9% | |
| n | 173312 | 7.4% | |
| d | 166069 | 7.1% | |
| - | 164325 | 7.1% | |
| S | 161240 | 6.9% | |
| s | 160933 | 6.9% | |
| U | 159481 | 6.9% | |
| o | 22790 | 1.0% | |
| c | 17366 | 0.7% | |
| l | 11412 | 0.5% | |
| M | 10008 | 0.4% | |
| x | 10008 | 0.4% | |
| u | 9136 | 0.4% | |
| r | 8905 | 0.4% | |
| ? | 6713 | 0.3% | |
| P | 5794 | 0.2% | |
| m | 5005 | 0.2% | |
| C | 4171 | 0.2% | |
| y | 4033 | 0.2% | |
| p | 3990 | 0.2% | |
| R | 3970 | 0.2% | |
| I | 3692 | 0.2% | |
| Other values (22) | 22122 | 1.0% |
country_of_birth_mother
Categorical
| Distinct | 43 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| United-States | |
|---|---|
| Mexico | 9781 |
| ? | 6119 |
| Puerto-Rico | 2473 |
| Italy | 1844 |
| Other values (38) |
| Value | Count | Frequency (%) | |
| United-States | 160479 | 80.4% | |
| Mexico | 9781 | 4.9% | |
| ? | 6119 | 3.1% | |
| Puerto-Rico | 2473 | 1.2% | |
| Italy | 1844 | 0.9% | |
| Canada | 1451 | 0.7% | |
| Germany | 1382 | 0.7% | |
| Philippines | 1231 | 0.6% | |
| Poland | 1110 | 0.6% | |
| Cuba | 1108 | 0.6% | |
| El-Salvador | 1108 | 0.6% | |
| Dominican-Republic | 1103 | 0.6% | |
| England | 903 | 0.5% | |
| China | 760 | 0.4% | |
| Columbia | 612 | 0.3% | |
| South Korea | 609 | 0.3% | |
| Ireland | 599 | 0.3% | |
| India | 581 | 0.3% | |
| Vietnam | 473 | 0.2% | |
| Japan | 469 | 0.2% | |
| Jamaica | 453 | 0.2% | |
| Guatemala | 444 | 0.2% | |
| Ecuador | 375 | 0.2% | |
| Peru | 355 | 0.2% | |
| Haiti | 353 | 0.2% | |
| Other values (18) | 3348 | 1.7% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 28 |
|---|---|
| Median length | 13 |
| Mean length | 11.72127023 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| t | 488579 | 20.9% | |
| e | 340658 | 14.6% | |
| a | 187061 | 8.0% | |
| i | 184556 | 7.9% | |
| n | 174658 | 7.5% | |
| d | 167641 | 7.2% | |
| - | 165369 | 7.1% | |
| S | 162751 | 7.0% | |
| s | 162309 | 6.9% | |
| U | 160793 | 6.9% | |
| o | 22004 | 0.9% | |
| c | 16460 | 0.7% | |
| l | 11200 | 0.5% | |
| M | 9781 | 0.4% | |
| x | 9781 | 0.4% | |
| r | 8878 | 0.4% | |
| u | 8728 | 0.4% | |
| ? | 6119 | 0.3% | |
| P | 5543 | 0.2% | |
| m | 4813 | 0.2% | |
| C | 4088 | 0.2% | |
| p | 4034 | 0.2% | |
| y | 3680 | 0.2% | |
| R | 3576 | 0.2% | |
| I | 3379 | 0.1% | |
| Other values (22) | 22224 | 1.0% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 1804888 | 77.2% | |
| Uppercase Letter | 360530 | 15.4% | |
| Dash Punctuation | 165369 | 7.1% | |
| Other Punctuation | 6218 | 0.3% | |
| Space Separator | 1344 | 0.1% | |
| Open Punctuation | 157 | < 0.1% | |
| Close Punctuation | 157 | < 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| S | 162751 | 45.1% | |
| U | 160793 | 44.6% | |
| M | 9781 | 2.7% | |
| P | 5543 | 1.5% | |
| C | 4088 | 1.1% | |
| R | 3576 | 1.0% | |
| I | 3379 | 0.9% | |
| E | 2386 | 0.7% | |
| G | 2244 | 0.6% | |
| D | 1103 | 0.3% | |
| H | 1024 | 0.3% | |
| J | 922 | 0.3% | |
| K | 716 | 0.2% | |
| V | 630 | 0.2% | |
| T | 543 | 0.2% | |
| N | 350 | 0.1% | |
| F | 212 | 0.1% | |
| Y | 177 | < 0.1% | |
| O | 157 | < 0.1% | |
| L | 155 | < 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| t | 488579 | 27.1% | |
| e | 340658 | 18.9% | |
| a | 187061 | 10.4% | |
| i | 184556 | 10.2% | |
| n | 174658 | 9.7% | |
| d | 167641 | 9.3% | |
| s | 162309 | 9.0% | |
| o | 22004 | 1.2% | |
| c | 16460 | 0.9% | |
| l | 11200 | 0.6% | |
| x | 9781 | 0.5% | |
| r | 8878 | 0.5% | |
| u | 8728 | 0.5% | |
| m | 4813 | 0.3% | |
| p | 4034 | 0.2% | |
| y | 3680 | 0.2% | |
| b | 3079 | 0.2% | |
| h | 2772 | 0.2% | |
| g | 2490 | 0.1% | |
| v | 1285 | 0.1% | |
| w | 222 | < 0.1% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 165369 | 100.0% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| ? | 6119 | 98.4% | |
| & | 99 | 1.6% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 1344 | 100.0% |
Most frequent Open Punctuation characters
| Value | Count | Frequency (%) | |
| ( | 157 | 100.0% |
Most frequent Close Punctuation characters
| Value | Count | Frequency (%) | |
| ) | 157 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 2165418 | 92.6% | |
| Common | 173245 | 7.4% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| t | 488579 | 22.6% | |
| e | 340658 | 15.7% | |
| a | 187061 | 8.6% | |
| i | 184556 | 8.5% | |
| n | 174658 | 8.1% | |
| d | 167641 | 7.7% | |
| S | 162751 | 7.5% | |
| s | 162309 | 7.5% | |
| U | 160793 | 7.4% | |
| o | 22004 | 1.0% | |
| c | 16460 | 0.8% | |
| l | 11200 | 0.5% | |
| M | 9781 | 0.5% | |
| x | 9781 | 0.5% | |
| r | 8878 | 0.4% | |
| u | 8728 | 0.4% | |
| P | 5543 | 0.3% | |
| m | 4813 | 0.2% | |
| C | 4088 | 0.2% | |
| p | 4034 | 0.2% | |
| y | 3680 | 0.2% | |
| R | 3576 | 0.2% | |
| I | 3379 | 0.2% | |
| b | 3079 | 0.1% | |
| h | 2772 | 0.1% | |
| Other values (16) | 14616 | 0.7% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| - | 165369 | 95.5% | |
| ? | 6119 | 3.5% | |
| 1344 | 0.8% | ||
| ( | 157 | 0.1% | |
| ) | 157 | 0.1% | |
| & | 99 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 2338663 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| t | 488579 | 20.9% | |
| e | 340658 | 14.6% | |
| a | 187061 | 8.0% | |
| i | 184556 | 7.9% | |
| n | 174658 | 7.5% | |
| d | 167641 | 7.2% | |
| - | 165369 | 7.1% | |
| S | 162751 | 7.0% | |
| s | 162309 | 6.9% | |
| U | 160793 | 6.9% | |
| o | 22004 | 0.9% | |
| c | 16460 | 0.7% | |
| l | 11200 | 0.5% | |
| M | 9781 | 0.4% | |
| x | 9781 | 0.4% | |
| r | 8878 | 0.4% | |
| u | 8728 | 0.4% | |
| ? | 6119 | 0.3% | |
| P | 5543 | 0.2% | |
| m | 4813 | 0.2% | |
| C | 4088 | 0.2% | |
| p | 4034 | 0.2% | |
| y | 3680 | 0.2% | |
| R | 3576 | 0.2% | |
| I | 3379 | 0.1% | |
| Other values (22) | 22224 | 1.0% |
country_of_birth_self
Categorical
| Distinct | 43 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| United-States | |
|---|---|
| Mexico | 5767 |
| ? | 3393 |
| Puerto-Rico | 1400 |
| Germany | 851 |
| Other values (38) | 11123 |
| Value | Count | Frequency (%) | |
| United-States | 176989 | 88.7% | |
| Mexico | 5767 | 2.9% | |
| ? | 3393 | 1.7% | |
| Puerto-Rico | 1400 | 0.7% | |
| Germany | 851 | 0.4% | |
| Philippines | 845 | 0.4% | |
| Cuba | 837 | 0.4% | |
| Canada | 700 | 0.4% | |
| Dominican-Republic | 690 | 0.3% | |
| El-Salvador | 689 | 0.3% | |
| China | 478 | 0.2% | |
| South Korea | 471 | 0.2% | |
| England | 457 | 0.2% | |
| Columbia | 434 | 0.2% | |
| Italy | 419 | 0.2% | |
| India | 408 | 0.2% | |
| Vietnam | 391 | 0.2% | |
| Poland | 381 | 0.2% | |
| Guatemala | 344 | 0.2% | |
| Japan | 339 | 0.2% | |
| Jamaica | 320 | 0.2% | |
| Peru | 268 | 0.1% | |
| Ecuador | 258 | 0.1% | |
| Haiti | 228 | 0.1% | |
| Nicaragua | 218 | 0.1% | |
| Other values (18) | 1948 | 1.0% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 28 |
|---|---|
| Median length | 13 |
| Mean length | 12.27975722 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| t | 534730 | 21.8% | |
| e | 365867 | 14.9% | |
| a | 192481 | 7.9% | |
| i | 192126 | 7.8% | |
| n | 185160 | 7.6% | |
| d | 180622 | 7.4% | |
| - | 179910 | 7.3% | |
| S | 178462 | 7.3% | |
| s | 178172 | 7.3% | |
| U | 177227 | 7.2% | |
| o | 12975 | 0.5% | |
| c | 9805 | 0.4% | |
| M | 5767 | 0.2% | |
| x | 5767 | 0.2% | |
| l | 5676 | 0.2% | |
| u | 5621 | 0.2% | |
| r | 5201 | 0.2% | |
| ? | 3393 | 0.1% | |
| m | 3272 | 0.1% | |
| P | 3096 | 0.1% | |
| p | 2719 | 0.1% | |
| C | 2544 | 0.1% | |
| b | 2122 | 0.1% | |
| R | 2090 | 0.1% | |
| h | 1930 | 0.1% | |
| Other values (22) | 13359 | 0.5% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 1888049 | 77.1% | |
| Uppercase Letter | 377391 | 15.4% | |
| Dash Punctuation | 179910 | 7.3% | |
| Other Punctuation | 3459 | 0.1% | |
| Space Separator | 1047 | < 0.1% | |
| Open Punctuation | 119 | < 0.1% | |
| Close Punctuation | 119 | < 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| S | 178462 | 47.3% | |
| U | 177227 | 47.0% | |
| M | 5767 | 1.5% | |
| P | 3096 | 0.8% | |
| C | 2544 | 0.7% | |
| R | 2090 | 0.6% | |
| G | 1461 | 0.4% | |
| E | 1404 | 0.4% | |
| I | 1238 | 0.3% | |
| D | 690 | 0.2% | |
| J | 659 | 0.2% | |
| H | 574 | 0.2% | |
| K | 571 | 0.2% | |
| V | 510 | 0.1% | |
| T | 446 | 0.1% | |
| N | 241 | 0.1% | |
| F | 121 | < 0.1% | |
| O | 119 | < 0.1% | |
| L | 105 | < 0.1% | |
| Y | 66 | < 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| t | 534730 | 28.3% | |
| e | 365867 | 19.4% | |
| a | 192481 | 10.2% | |
| i | 192126 | 10.2% | |
| n | 185160 | 9.8% | |
| d | 180622 | 9.6% | |
| s | 178172 | 9.4% | |
| o | 12975 | 0.7% | |
| c | 9805 | 0.5% | |
| x | 5767 | 0.3% | |
| l | 5676 | 0.3% | |
| u | 5621 | 0.3% | |
| r | 5201 | 0.3% | |
| m | 3272 | 0.2% | |
| p | 2719 | 0.1% | |
| b | 2122 | 0.1% | |
| h | 1930 | 0.1% | |
| y | 1468 | 0.1% | |
| g | 1379 | 0.1% | |
| v | 755 | < 0.1% | |
| w | 201 | < 0.1% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 179910 | 100.0% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| ? | 3393 | 98.1% | |
| & | 66 | 1.9% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 1047 | 100.0% |
Most frequent Open Punctuation characters
| Value | Count | Frequency (%) | |
| ( | 119 | 100.0% |
Most frequent Close Punctuation characters
| Value | Count | Frequency (%) | |
| ) | 119 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 2265440 | 92.5% | |
| Common | 184654 | 7.5% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| t | 534730 | 23.6% | |
| e | 365867 | 16.1% | |
| a | 192481 | 8.5% | |
| i | 192126 | 8.5% | |
| n | 185160 | 8.2% | |
| d | 180622 | 8.0% | |
| S | 178462 | 7.9% | |
| s | 178172 | 7.9% | |
| U | 177227 | 7.8% | |
| o | 12975 | 0.6% | |
| c | 9805 | 0.4% | |
| M | 5767 | 0.3% | |
| x | 5767 | 0.3% | |
| l | 5676 | 0.3% | |
| u | 5621 | 0.2% | |
| r | 5201 | 0.2% | |
| m | 3272 | 0.1% | |
| P | 3096 | 0.1% | |
| p | 2719 | 0.1% | |
| C | 2544 | 0.1% | |
| b | 2122 | 0.1% | |
| R | 2090 | 0.1% | |
| h | 1930 | 0.1% | |
| y | 1468 | 0.1% | |
| G | 1461 | 0.1% | |
| Other values (16) | 9079 | 0.4% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| - | 179910 | 97.4% | |
| ? | 3393 | 1.8% | |
| 1047 | 0.6% | ||
| ( | 119 | 0.1% | |
| ) | 119 | 0.1% | |
| & | 66 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 2450094 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| t | 534730 | 21.8% | |
| e | 365867 | 14.9% | |
| a | 192481 | 7.9% | |
| i | 192126 | 7.8% | |
| n | 185160 | 7.6% | |
| d | 180622 | 7.4% | |
| - | 179910 | 7.3% | |
| S | 178462 | 7.3% | |
| s | 178172 | 7.3% | |
| U | 177227 | 7.2% | |
| o | 12975 | 0.5% | |
| c | 9805 | 0.4% | |
| M | 5767 | 0.2% | |
| x | 5767 | 0.2% | |
| l | 5676 | 0.2% | |
| u | 5621 | 0.2% | |
| r | 5201 | 0.2% | |
| ? | 3393 | 0.1% | |
| m | 3272 | 0.1% | |
| P | 3096 | 0.1% | |
| p | 2719 | 0.1% | |
| C | 2544 | 0.1% | |
| b | 2122 | 0.1% | |
| R | 2090 | 0.1% | |
| h | 1930 | 0.1% | |
| Other values (22) | 13359 | 0.5% |
citizenship
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Native- Born in the United States | |
|---|---|
| Foreign born- Not a citizen of U S | 13401 |
| Foreign born- U S citizen by naturalization | 5855 |
| Native- Born abroad of American Parent(s) | 1756 |
| Native- Born in Puerto Rico or U S Outlying | 1519 |
| Value | Count | Frequency (%) | |
| Native- Born in the United States | 176992 | 88.7% | |
| Foreign born- Not a citizen of U S | 13401 | 6.7% | |
| Foreign born- U S citizen by naturalization | 5855 | 2.9% | |
| Native- Born abroad of American Parent(s) | 1756 | 0.9% | |
| Native- Born in Puerto Rico or U S Outlying | 1519 | 0.8% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 43 |
|---|---|
| Median length | 33 |
| Mean length | 33.50715456 |
| Min length | 33 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 1034829 | 15.5% | ||
| t | 937396 | 14.0% | |
| e | 754786 | 11.3% | |
| n | 610279 | 9.1% | |
| i | 610042 | 9.1% | |
| a | 395249 | 5.9% | |
| o | 259505 | 3.9% | |
| r | 232940 | 3.5% | |
| - | 199523 | 3.0% | |
| U | 197767 | 3.0% | |
| S | 197767 | 3.0% | |
| N | 193668 | 2.9% | |
| v | 180267 | 2.7% | |
| B | 180267 | 2.7% | |
| d | 178748 | 2.7% | |
| s | 178748 | 2.7% | |
| h | 176992 | 2.6% | |
| b | 26867 | 0.4% | |
| z | 25111 | 0.4% | |
| c | 22531 | 0.3% | |
| g | 20775 | 0.3% | |
| F | 19256 | 0.3% | |
| f | 15157 | 0.2% | |
| u | 8893 | 0.1% | |
| y | 7374 | 0.1% | |
| Other values (8) | 20711 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 4650790 | 69.6% | |
| Space Separator | 1034829 | 15.5% | |
| Uppercase Letter | 796794 | 11.9% | |
| Dash Punctuation | 199523 | 3.0% | |
| Open Punctuation | 1756 | < 0.1% | |
| Close Punctuation | 1756 | < 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| U | 197767 | 24.8% | |
| S | 197767 | 24.8% | |
| N | 193668 | 24.3% | |
| B | 180267 | 22.6% | |
| F | 19256 | 2.4% | |
| P | 3275 | 0.4% | |
| A | 1756 | 0.2% | |
| R | 1519 | 0.2% | |
| O | 1519 | 0.2% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| t | 937396 | 20.2% | |
| e | 754786 | 16.2% | |
| n | 610279 | 13.1% | |
| i | 610042 | 13.1% | |
| a | 395249 | 8.5% | |
| o | 259505 | 5.6% | |
| r | 232940 | 5.0% | |
| v | 180267 | 3.9% | |
| d | 178748 | 3.8% | |
| s | 178748 | 3.8% | |
| h | 176992 | 3.8% | |
| b | 26867 | 0.6% | |
| z | 25111 | 0.5% | |
| c | 22531 | 0.5% | |
| g | 20775 | 0.4% | |
| f | 15157 | 0.3% | |
| u | 8893 | 0.2% | |
| y | 7374 | 0.2% | |
| l | 7374 | 0.2% | |
| m | 1756 | < 0.1% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 199523 | 100.0% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 1034829 | 100.0% |
Most frequent Open Punctuation characters
| Value | Count | Frequency (%) | |
| ( | 1756 | 100.0% |
Most frequent Close Punctuation characters
| Value | Count | Frequency (%) | |
| ) | 1756 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 5447584 | 81.5% | |
| Common | 1237864 | 18.5% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| t | 937396 | 17.2% | |
| e | 754786 | 13.9% | |
| n | 610279 | 11.2% | |
| i | 610042 | 11.2% | |
| a | 395249 | 7.3% | |
| o | 259505 | 4.8% | |
| r | 232940 | 4.3% | |
| U | 197767 | 3.6% | |
| S | 197767 | 3.6% | |
| N | 193668 | 3.6% | |
| v | 180267 | 3.3% | |
| B | 180267 | 3.3% | |
| d | 178748 | 3.3% | |
| s | 178748 | 3.3% | |
| h | 176992 | 3.2% | |
| b | 26867 | 0.5% | |
| z | 25111 | 0.5% | |
| c | 22531 | 0.4% | |
| g | 20775 | 0.4% | |
| F | 19256 | 0.4% | |
| f | 15157 | 0.3% | |
| u | 8893 | 0.2% | |
| y | 7374 | 0.1% | |
| l | 7374 | 0.1% | |
| P | 3275 | 0.1% | |
| Other values (4) | 6550 | 0.1% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 1034829 | 83.6% | ||
| - | 199523 | 16.1% | |
| ( | 1756 | 0.1% | |
| ) | 1756 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 6685448 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 1034829 | 15.5% | ||
| t | 937396 | 14.0% | |
| e | 754786 | 11.3% | |
| n | 610279 | 9.1% | |
| i | 610042 | 9.1% | |
| a | 395249 | 5.9% | |
| o | 259505 | 3.9% | |
| r | 232940 | 3.5% | |
| - | 199523 | 3.0% | |
| U | 197767 | 3.0% | |
| S | 197767 | 3.0% | |
| N | 193668 | 2.9% | |
| v | 180267 | 2.7% | |
| B | 180267 | 2.7% | |
| d | 178748 | 2.7% | |
| s | 178748 | 2.7% | |
| h | 176992 | 2.6% | |
| b | 26867 | 0.4% | |
| z | 25111 | 0.4% | |
| c | 22531 | 0.3% | |
| g | 20775 | 0.3% | |
| F | 19256 | 0.3% | |
| f | 15157 | 0.2% | |
| u | 8893 | 0.1% | |
| y | 7374 | 0.1% | |
| Other values (8) | 20711 | 0.3% |
own_business_or_self_employed
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| 0 | |
|---|---|
| 2 | 16153 |
| 1 | 2698 |
| Value | Count | Frequency (%) | |
| 0 | 180672 | 90.6% | |
| 2 | 16153 | 8.1% | |
| 1 | 2698 | 1.4% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 0 | 180672 | 90.6% | |
| 2 | 16153 | 8.1% | |
| 1 | 2698 | 1.4% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 199523 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 0 | 180672 | 90.6% | |
| 2 | 16153 | 8.1% | |
| 1 | 2698 | 1.4% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 199523 | 100.0% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 0 | 180672 | 90.6% | |
| 2 | 16153 | 8.1% | |
| 1 | 2698 | 1.4% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 199523 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 0 | 180672 | 90.6% | |
| 2 | 16153 | 8.1% | |
| 1 | 2698 | 1.4% |
fill_inc_questionnaire_for_veteran's_admin
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Not in universe | |
|---|---|
| No | 1593 |
| Yes | 391 |
| Value | Count | Frequency (%) | |
| Not in universe | 197539 | 99.0% | |
| No | 1593 | 0.8% | |
| Yes | 391 | 0.2% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 15 |
|---|---|
| Median length | 15 |
| Mean length | 14.87269137 |
| Min length | 2 |
Most occurring characters
| Value | Count | Frequency (%) | |
| e | 395469 | 13.3% | |
| 395078 | 13.3% | ||
| i | 395078 | 13.3% | |
| n | 395078 | 13.3% | |
| N | 199132 | 6.7% | |
| o | 199132 | 6.7% | |
| s | 197930 | 6.7% | |
| t | 197539 | 6.7% | |
| u | 197539 | 6.7% | |
| v | 197539 | 6.7% | |
| r | 197539 | 6.7% | |
| Y | 391 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 2372843 | 80.0% | |
| Space Separator | 395078 | 13.3% | |
| Uppercase Letter | 199523 | 6.7% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| N | 199132 | 99.8% | |
| Y | 391 | 0.2% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| e | 395469 | 16.7% | |
| i | 395078 | 16.6% | |
| n | 395078 | 16.6% | |
| o | 199132 | 8.4% | |
| s | 197930 | 8.3% | |
| t | 197539 | 8.3% | |
| u | 197539 | 8.3% | |
| v | 197539 | 8.3% | |
| r | 197539 | 8.3% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 395078 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 2572366 | 86.7% | |
| Common | 395078 | 13.3% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| e | 395469 | 15.4% | |
| i | 395078 | 15.4% | |
| n | 395078 | 15.4% | |
| N | 199132 | 7.7% | |
| o | 199132 | 7.7% | |
| s | 197930 | 7.7% | |
| t | 197539 | 7.7% | |
| u | 197539 | 7.7% | |
| v | 197539 | 7.7% | |
| r | 197539 | 7.7% | |
| Y | 391 | < 0.1% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 395078 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 2967444 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| e | 395469 | 13.3% | |
| 395078 | 13.3% | ||
| i | 395078 | 13.3% | |
| n | 395078 | 13.3% | |
| N | 199132 | 6.7% | |
| o | 199132 | 6.7% | |
| s | 197930 | 6.7% | |
| t | 197539 | 6.7% | |
| u | 197539 | 6.7% | |
| v | 197539 | 6.7% | |
| r | 197539 | 6.7% | |
| Y | 391 | < 0.1% |
veterans_benefits
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| 2 | |
|---|---|
| 0 | |
| 1 | 1984 |
| Value | Count | Frequency (%) | |
| 2 | 150130 | 75.2% | |
| 0 | 47409 | 23.8% | |
| 1 | 1984 | 1.0% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 2 | 150130 | 75.2% | |
| 0 | 47409 | 23.8% | |
| 1 | 1984 | 1.0% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 199523 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 2 | 150130 | 75.2% | |
| 0 | 47409 | 23.8% | |
| 1 | 1984 | 1.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 199523 | 100.0% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 2 | 150130 | 75.2% | |
| 0 | 47409 | 23.8% | |
| 1 | 1984 | 1.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 199523 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 2 | 150130 | 75.2% | |
| 0 | 47409 | 23.8% | |
| 1 | 1984 | 1.0% |
| Distinct | 53 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 23.17489713 |
|---|---|
| Minimum | 0 |
| Maximum | 52 |
| Zeros | 95983 |
| Zeros (%) | 48.1% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 8 |
| Q3 | 52 |
| 95-th percentile | 52 |
| Maximum | 52 |
| Range | 52 |
| Interquartile range (IQR) | 52 |
Descriptive statistics
| Standard deviation | 24.41148817 |
|---|---|
| Coefficient of variation (CV) | 1.053359073 |
| Kurtosis | -1.863805826 |
| Mean | 23.17489713 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.2101693419 |
| Sum | 4623925 |
| Variance | 595.9207546 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 95983 | 48.1% | |
| 52 | 70314 | 35.2% | |
| 40 | 2790 | 1.4% | |
| 50 | 2304 | 1.2% | |
| 26 | 2268 | 1.1% | |
| 48 | 1806 | 0.9% | |
| 12 | 1780 | 0.9% | |
| 30 | 1378 | 0.7% | |
| 20 | 1330 | 0.7% | |
| 8 | 1126 | 0.6% | |
| 36 | 1108 | 0.6% | |
| 16 | 945 | 0.5% | |
| 32 | 883 | 0.4% | |
| 44 | 845 | 0.4% | |
| 51 | 819 | 0.4% | |
| 24 | 767 | 0.4% | |
| 4 | 757 | 0.4% | |
| 46 | 708 | 0.4% | |
| 35 | 704 | 0.4% | |
| 10 | 694 | 0.3% | |
| 45 | 669 | 0.3% | |
| 6 | 646 | 0.3% | |
| 39 | 602 | 0.3% | |
| 42 | 573 | 0.3% | |
| 28 | 568 | 0.3% | |
| Other values (28) | 7156 | 3.6% |
| Value | Count | Frequency (%) | |
| 0 | 95983 | 48.1% | |
| 1 | 464 | 0.2% | |
| 2 | 458 | 0.2% | |
| 3 | 417 | 0.2% | |
| 4 | 757 | 0.4% | |
| 5 | 309 | 0.2% | |
| 6 | 646 | 0.3% | |
| 7 | 152 | 0.1% | |
| 8 | 1126 | 0.6% | |
| 9 | 239 | 0.1% |
| Value | Count | Frequency (%) | |
| 52 | 70314 | 35.2% | |
| 51 | 819 | 0.4% | |
| 50 | 2304 | 1.2% | |
| 49 | 509 | 0.3% | |
| 48 | 1806 | 0.9% | |
| 47 | 278 | 0.1% | |
| 46 | 708 | 0.4% | |
| 45 | 669 | 0.3% | |
| 44 | 845 | 0.4% | |
| 43 | 374 | 0.2% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| 94 | |
|---|---|
| 95 |
| Value | Count | Frequency (%) | |
| 94 | 99827 | 50.0% | |
| 95 | 99696 | 50.0% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 9 | 199523 | 50.0% | |
| 4 | 99827 | 25.0% | |
| 5 | 99696 | 25.0% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 399046 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 9 | 199523 | 50.0% | |
| 4 | 99827 | 25.0% | |
| 5 | 99696 | 25.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 399046 | 100.0% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 9 | 199523 | 50.0% | |
| 4 | 99827 | 25.0% | |
| 5 | 99696 | 25.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 399046 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 9 | 199523 | 50.0% | |
| 4 | 99827 | 25.0% | |
| 5 | 99696 | 25.0% |
income
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| - 50000. | |
|---|---|
| 50000+. | 12382 |
| Value | Count | Frequency (%) | |
| - 50000. | 187141 | 93.8% | |
| 50000+. | 12382 | 6.2% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 7.937941992 |
| Min length | 7 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 0 | 798092 | 50.4% | |
| 5 | 199523 | 12.6% | |
| . | 199523 | 12.6% | |
| - | 187141 | 11.8% | |
| 187141 | 11.8% | ||
| + | 12382 | 0.8% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 997615 | 63.0% | |
| Other Punctuation | 199523 | 12.6% | |
| Dash Punctuation | 187141 | 11.8% | |
| Space Separator | 187141 | 11.8% | |
| Math Symbol | 12382 | 0.8% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 187141 | 100.0% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 187141 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 0 | 798092 | 80.0% | |
| 5 | 199523 | 20.0% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| . | 199523 | 100.0% |
Most frequent Math Symbol characters
| Value | Count | Frequency (%) | |
| + | 12382 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 1583802 | 100.0% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 0 | 798092 | 50.4% | |
| 5 | 199523 | 12.6% | |
| . | 199523 | 12.6% | |
| - | 187141 | 11.8% | |
| 187141 | 11.8% | ||
| + | 12382 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 1583802 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 0 | 798092 | 50.4% | |
| 5 | 199523 | 12.6% | |
| . | 199523 | 12.6% | |
| - | 187141 | 11.8% | |
| 187141 | 11.8% | ||
| + | 12382 | 0.8% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| age | class_of_worker | detailed_industry_recode | detailed_occupation_recode | education | wage_per_hour | enroll_in_edu_inst_last_wk | marital_stat | major_industry_code | major_occupation_code | race | hispanic_origin | sex | member_of_a_labor_union | reason_for_unemployment | full_or_part_time_employment_stat | capital_gains | capital_losses | dividends_from_stocks | tax_filer_stat | region_of_previous_residence | state_of_previous_residence | detailed_household_and_family_stat | detailed_household_summary_in_household | instance_weight | migration_code-change_in_msa | migration_code-change_in_reg | migration_code-move_within_reg | live_in_this_house_1_year_ago | migration_prev_res_in_sunbelt | num_persons_worked_for_employer | family_members_under_18 | country_of_birth_father | country_of_birth_mother | country_of_birth_self | citizenship | own_business_or_self_employed | fill_inc_questionnaire_for_veteran's_admin | veterans_benefits | weeks_worked_in_year | year | income | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 73 | Not in universe | 0 | 0 | High school graduate | 0 | Not in universe | Widowed | Not in universe or children | Not in universe | White | All other | Female | Not in universe | Not in universe | Not in labor force | 0 | 0 | 0 | Nonfiler | Not in universe | Not in universe | Other Rel 18+ ever marr not in subfamily | Other relative of householder | 1700.09 | ? | ? | ? | Not in universe under 1 year old | ? | 0 | Not in universe | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 2 | 0 | 95 | - 50000. |
| 1 | 58 | Self-employed-not incorporated | 4 | 34 | Some college but no degree | 0 | Not in universe | Divorced | Construction | Precision production craft & repair | White | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Head of household | South | Arkansas | Householder | Householder | 1053.55 | MSA to MSA | Same county | Same county | No | Yes | 1 | Not in universe | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 2 | 52 | 94 | - 50000. |
| 2 | 18 | Not in universe | 0 | 0 | 10th grade | 0 | High school | Never married | Not in universe or children | Not in universe | Asian or Pacific Islander | All other | Female | Not in universe | Not in universe | Not in labor force | 0 | 0 | 0 | Nonfiler | Not in universe | Not in universe | Child 18+ never marr Not in a subfamily | Child 18 or older | 991.95 | ? | ? | ? | Not in universe under 1 year old | ? | 0 | Not in universe | Vietnam | Vietnam | Vietnam | Foreign born- Not a citizen of U S | 0 | Not in universe | 2 | 0 | 95 | - 50000. |
| 3 | 9 | Not in universe | 0 | 0 | Children | 0 | Not in universe | Never married | Not in universe or children | Not in universe | White | All other | Female | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Nonfiler | Not in universe | Not in universe | Child <18 never marr not in subfamily | Child under 18 never married | 1758.14 | Nonmover | Nonmover | Nonmover | Yes | Not in universe | 0 | Both parents present | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 0 | 0 | 94 | - 50000. |
| 4 | 10 | Not in universe | 0 | 0 | Children | 0 | Not in universe | Never married | Not in universe or children | Not in universe | White | All other | Female | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Nonfiler | Not in universe | Not in universe | Child <18 never marr not in subfamily | Child under 18 never married | 1069.16 | Nonmover | Nonmover | Nonmover | Yes | Not in universe | 0 | Both parents present | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 0 | 0 | 94 | - 50000. |
| 5 | 48 | Private | 40 | 10 | Some college but no degree | 1200 | Not in universe | Married-civilian spouse present | Entertainment | Professional specialty | Amer Indian Aleut or Eskimo | All other | Female | No | Not in universe | Full-time schedules | 0 | 0 | 0 | Joint both under 65 | Not in universe | Not in universe | Spouse of householder | Spouse of householder | 162.61 | ? | ? | ? | Not in universe under 1 year old | ? | 1 | Not in universe | Philippines | United-States | United-States | Native- Born in the United States | 2 | Not in universe | 2 | 52 | 95 | - 50000. |
| 6 | 42 | Private | 34 | 3 | Bachelors degree(BA AB BS) | 0 | Not in universe | Married-civilian spouse present | Finance insurance and real estate | Executive admin and managerial | White | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 5178 | 0 | 0 | Joint both under 65 | Not in universe | Not in universe | Householder | Householder | 1535.86 | Nonmover | Nonmover | Nonmover | Yes | Not in universe | 6 | Not in universe | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 2 | 52 | 94 | - 50000. |
| 7 | 28 | Private | 4 | 40 | High school graduate | 0 | Not in universe | Never married | Construction | Handlers equip cleaners etc | White | All other | Female | Not in universe | Job loser - on layoff | Unemployed full-time | 0 | 0 | 0 | Single | Not in universe | Not in universe | Secondary individual | Nonrelative of householder | 898.83 | ? | ? | ? | Not in universe under 1 year old | ? | 4 | Not in universe | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 2 | 30 | 95 | - 50000. |
| 8 | 47 | Local government | 43 | 26 | Some college but no degree | 876 | Not in universe | Married-civilian spouse present | Education | Adm support including clerical | White | All other | Female | No | Not in universe | Full-time schedules | 0 | 0 | 0 | Joint both under 65 | Not in universe | Not in universe | Spouse of householder | Spouse of householder | 1661.53 | ? | ? | ? | Not in universe under 1 year old | ? | 5 | Not in universe | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 2 | 52 | 95 | - 50000. |
| 9 | 34 | Private | 4 | 37 | Some college but no degree | 0 | Not in universe | Married-civilian spouse present | Construction | Machine operators assmblrs & inspctrs | White | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Joint both under 65 | Not in universe | Not in universe | Householder | Householder | 1146.79 | Nonmover | Nonmover | Nonmover | Yes | Not in universe | 6 | Not in universe | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 2 | 52 | 94 | - 50000. |
Last rows
| age | class_of_worker | detailed_industry_recode | detailed_occupation_recode | education | wage_per_hour | enroll_in_edu_inst_last_wk | marital_stat | major_industry_code | major_occupation_code | race | hispanic_origin | sex | member_of_a_labor_union | reason_for_unemployment | full_or_part_time_employment_stat | capital_gains | capital_losses | dividends_from_stocks | tax_filer_stat | region_of_previous_residence | state_of_previous_residence | detailed_household_and_family_stat | detailed_household_summary_in_household | instance_weight | migration_code-change_in_msa | migration_code-change_in_reg | migration_code-move_within_reg | live_in_this_house_1_year_ago | migration_prev_res_in_sunbelt | num_persons_worked_for_employer | family_members_under_18 | country_of_birth_father | country_of_birth_mother | country_of_birth_self | citizenship | own_business_or_self_employed | fill_inc_questionnaire_for_veteran's_admin | veterans_benefits | weeks_worked_in_year | year | income | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 199513 | 57 | Private | 9 | 37 | 9th grade | 0 | Not in universe | Divorced | Manufacturing-durable goods | Machine operators assmblrs & inspctrs | White | Central or South American | Female | Not in universe | Not in universe | Full-time schedules | 0 | 0 | 0 | Single | Not in universe | Not in universe | Householder | Householder | 743.66 | ? | ? | ? | Not in universe under 1 year old | ? | 4 | Not in universe | Dominican-Republic | Dominican-Republic | Dominican-Republic | Foreign born- Not a citizen of U S | 0 | Not in universe | 2 | 52 | 95 | - 50000. |
| 199514 | 51 | Private | 33 | 19 | 10th grade | 0 | Not in universe | Widowed | Retail trade | Sales | White | All other | Female | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Single | South | North Dakota | Householder | Householder | 1302.34 | NonMSA to nonMSA | Same county | Same county | No | Yes | 6 | Not in universe | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 2 | 52 | 94 | - 50000. |
| 199515 | 87 | Not in universe | 0 | 0 | High school graduate | 0 | Not in universe | Widowed | Not in universe or children | Not in universe | White | All other | Female | Not in universe | Not in universe | Not in labor force | 0 | 0 | 0 | Single | Not in universe | Not in universe | Nonfamily householder | Householder | 3255.80 | ? | ? | ? | Not in universe under 1 year old | ? | 0 | Not in universe | ? | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 2 | 0 | 95 | - 50000. |
| 199516 | 3 | Not in universe | 0 | 0 | Children | 0 | Not in universe | Never married | Not in universe or children | Not in universe | Black | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Nonfiler | South | Utah | Child under 18 of RP of unrel subfamily | Nonrelative of householder | 2733.75 | MSA to MSA | Same county | Same county | No | Yes | 0 | Mother only present | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 0 | 0 | 94 | - 50000. |
| 199517 | 39 | Private | 43 | 26 | Bachelors degree(BA AB BS) | 0 | Not in universe | Never married | Education | Adm support including clerical | Other | Mexican-American | Male | No | Not in universe | Full-time schedules | 6849 | 0 | 0 | Single | Not in universe | Not in universe | Nonfamily householder | Householder | 908.14 | ? | ? | ? | Not in universe under 1 year old | ? | 6 | Not in universe | Mexico | Mexico | Mexico | Foreign born- Not a citizen of U S | 2 | Not in universe | 2 | 52 | 95 | - 50000. |
| 199518 | 87 | Not in universe | 0 | 0 | 7th and 8th grade | 0 | Not in universe | Married-civilian spouse present | Not in universe or children | Not in universe | White | All other | Male | Not in universe | Not in universe | Not in labor force | 0 | 0 | 0 | Joint both 65+ | Not in universe | Not in universe | Householder | Householder | 955.27 | ? | ? | ? | Not in universe under 1 year old | ? | 0 | Not in universe | Canada | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 2 | 0 | 95 | - 50000. |
| 199519 | 65 | Self-employed-incorporated | 37 | 2 | 11th grade | 0 | Not in universe | Married-civilian spouse present | Business and repair services | Executive admin and managerial | White | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 6418 | 0 | 9 | Joint one under 65 & one 65+ | Not in universe | Not in universe | Householder | Householder | 687.19 | Nonmover | Nonmover | Nonmover | Yes | Not in universe | 1 | Not in universe | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 2 | 52 | 94 | - 50000. |
| 199520 | 47 | Not in universe | 0 | 0 | Some college but no degree | 0 | Not in universe | Married-civilian spouse present | Not in universe or children | Not in universe | White | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 157 | Joint both under 65 | Not in universe | Not in universe | Householder | Householder | 1923.03 | ? | ? | ? | Not in universe under 1 year old | ? | 6 | Not in universe | Poland | Poland | Germany | Foreign born- U S citizen by naturalization | 0 | Not in universe | 2 | 52 | 95 | - 50000. |
| 199521 | 16 | Not in universe | 0 | 0 | 10th grade | 0 | High school | Never married | Not in universe or children | Not in universe | White | All other | Female | Not in universe | Not in universe | Not in labor force | 0 | 0 | 0 | Nonfiler | Not in universe | Not in universe | Child <18 never marr not in subfamily | Child under 18 never married | 4664.87 | ? | ? | ? | Not in universe under 1 year old | ? | 0 | Both parents present | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 2 | 0 | 95 | - 50000. |
| 199522 | 32 | Private | 42 | 30 | High school graduate | 0 | Not in universe | Never married | Medical except hospital | Other service | Black | All other | Female | No | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Single | Not in universe | Not in universe | Nonfamily householder | Householder | 1830.11 | Nonmover | Nonmover | Nonmover | Yes | Not in universe | 6 | Not in universe | ? | ? | ? | Foreign born- Not a citizen of U S | 0 | Not in universe | 2 | 52 | 94 | - 50000. |
Most frequent
| age | class_of_worker | detailed_industry_recode | detailed_occupation_recode | education | wage_per_hour | enroll_in_edu_inst_last_wk | marital_stat | major_industry_code | major_occupation_code | race | hispanic_origin | sex | member_of_a_labor_union | reason_for_unemployment | full_or_part_time_employment_stat | capital_gains | capital_losses | dividends_from_stocks | tax_filer_stat | region_of_previous_residence | state_of_previous_residence | detailed_household_and_family_stat | detailed_household_summary_in_household | instance_weight | migration_code-change_in_msa | migration_code-change_in_reg | migration_code-move_within_reg | live_in_this_house_1_year_ago | migration_prev_res_in_sunbelt | num_persons_worked_for_employer | family_members_under_18 | country_of_birth_father | country_of_birth_mother | country_of_birth_self | citizenship | own_business_or_self_employed | fill_inc_questionnaire_for_veteran's_admin | veterans_benefits | weeks_worked_in_year | year | income | count | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 559 | 3 | Not in universe | 0 | 0 | Children | 0 | Not in universe | Never married | Not in universe or children | Not in universe | White | All other | Female | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Nonfiler | Not in universe | Not in universe | Child <18 never marr not in subfamily | Child under 18 never married | 2125.99 | ? | ? | ? | Not in universe under 1 year old | ? | 0 | Both parents present | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 0 | 0 | 95 | - 50000. | 6 |
| 1947 | 11 | Not in universe | 0 | 0 | Children | 0 | Not in universe | Never married | Not in universe or children | Not in universe | White | All other | Female | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Nonfiler | Not in universe | Not in universe | Child <18 never marr not in subfamily | Child under 18 never married | 1131.62 | Nonmover | Nonmover | Nonmover | Yes | Not in universe | 0 | Both parents present | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 0 | 0 | 94 | - 50000. | 6 |
| 104 | 0 | Not in universe | 0 | 0 | Children | 0 | Not in universe | Never married | Not in universe or children | Not in universe | White | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Nonfiler | Not in universe | Not in universe | Child <18 never marr not in subfamily | Child under 18 never married | 1363.88 | ? | ? | ? | Not in universe under 1 year old | ? | 0 | Both parents present | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 0 | 0 | 95 | - 50000. | 5 |
| 358 | 2 | Not in universe | 0 | 0 | Children | 0 | Not in universe | Never married | Not in universe or children | Not in universe | White | All other | Female | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Nonfiler | Not in universe | Not in universe | Child <18 never marr not in subfamily | Child under 18 never married | 1182.42 | ? | ? | ? | Not in universe under 1 year old | ? | 0 | Both parents present | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 0 | 0 | 95 | - 50000. | 5 |
| 590 | 3 | Not in universe | 0 | 0 | Children | 0 | Not in universe | Never married | Not in universe or children | Not in universe | White | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Nonfiler | Not in universe | Not in universe | Child <18 never marr not in subfamily | Child under 18 never married | 966.31 | ? | ? | ? | Not in universe under 1 year old | ? | 0 | Both parents present | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 0 | 0 | 95 | - 50000. | 5 |
| 603 | 3 | Not in universe | 0 | 0 | Children | 0 | Not in universe | Never married | Not in universe or children | Not in universe | White | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Nonfiler | Not in universe | Not in universe | Child <18 never marr not in subfamily | Child under 18 never married | 1220.24 | ? | ? | ? | Not in universe under 1 year old | ? | 0 | Both parents present | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 0 | 0 | 95 | - 50000. | 5 |
| 627 | 3 | Not in universe | 0 | 0 | Children | 0 | Not in universe | Never married | Not in universe or children | Not in universe | White | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Nonfiler | Not in universe | Not in universe | Child <18 never marr not in subfamily | Child under 18 never married | 1803.03 | Nonmover | Nonmover | Nonmover | Yes | Not in universe | 0 | Both parents present | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 0 | 0 | 94 | - 50000. | 5 |
| 881 | 5 | Not in universe | 0 | 0 | Children | 0 | Not in universe | Never married | Not in universe or children | Not in universe | White | All other | Female | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Nonfiler | Not in universe | Not in universe | Child <18 never marr not in subfamily | Child under 18 never married | 886.02 | ? | ? | ? | Not in universe under 1 year old | ? | 0 | Both parents present | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 0 | 0 | 95 | - 50000. | 5 |
| 1433 | 8 | Not in universe | 0 | 0 | Children | 0 | Not in universe | Never married | Not in universe or children | Not in universe | White | All other | Female | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Nonfiler | Not in universe | Not in universe | Child <18 never marr not in subfamily | Child under 18 never married | 1215.87 | ? | ? | ? | Not in universe under 1 year old | ? | 0 | Both parents present | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 0 | 0 | 95 | - 50000. | 5 |
| 1453 | 8 | Not in universe | 0 | 0 | Children | 0 | Not in universe | Never married | Not in universe or children | Not in universe | White | All other | Female | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Nonfiler | Not in universe | Not in universe | Child <18 never marr not in subfamily | Child under 18 never married | 1979.97 | ? | ? | ? | Not in universe under 1 year old | ? | 0 | Both parents present | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 0 | 0 | 95 | - 50000. | 5 |